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1.  INTRODUCTION 

.  A  major  problem  in  the  application  of  control  theory  Is  the  fact 
that  any  mathematical  model  used  for  design  or  analysis  is  only  an  approxi¬ 
mation  to  the  true  physical  system.  The  error  between  the  mathematical 
model  and  the  system  itself  (or  plant)  has  many  causes.  For  example,  coef¬ 
ficients  that  are  assumed  constant  in  the  model  may  in  reality  be  time 
varying.  Also,  the  system  may  be  too  complex  to  be  accurately  described  by 
a  mathematical  model  that  is  feasible  for  design  and  analysis  purposes. 

This  is  usually  the  case  since  a  feasible  model  is  typically  restricted  to 
be  linear,  time  invariant,  and  finite  dimensional. 

Consequently,  it  is  important  to  be  able  to  design  a  controller  so 
that  the  plant  model-controller  system  (or  nominal  system)  is  as  tolerant  to 
plant  variations  as  possible.  This  tolerance  is  important  in  two  ways. 

First,  the  nominal  system  must  remain  stable  under  the  expected  range  of 
plant  variations.  This  system  quality  is  often  referred  to  as  the  robust¬ 
ness  of  the  system.  Second,  the  input  output  (I/O)  characteristics  of  the 
nominal  system  should  be  as  insensitive  as  possible  to  these  variations.  The 
nominal  system  is  said  to  have  good  sensitivity  properties  if  the  effect  of 
these  variations  on  the  I/O  response  is  reduced  from  the  effect  on  the  plant 
alone . 

The  field  of  control  theory  is  based  upon  the  fact  that  feedback 
can  be  used  to  improve  these  and  other  important  system  qualities  [1],[2]. 

For  example,  the  effect  of  uncontrollable  plant  disturbances  on  the  output 
of  the  plant  can  be  reduced  by  feedback.  Feedback  controllers  can  also  be 


used  to  achieve  a  desirable  I/O  response,  although  this  in  itself  does  not 
necessarily  require  feedback  [3]. 

The  synthesis  of  closed  loop  controllers  for  single-input  single¬ 
output  (SISO)  systems  with  these  properties  is  well  understood.  Root  locus 
plots,  Nyquist  diagrams,  and  other  classical  frequency  domain  techniques  are 
readily  used  for  design.  The  Nyquist  diagram  is  particularly  helpful  since 
the  distance  from  the  Nyquist  locus  to  the  critical  point  (called  the  return 
difference  function)  provides  a  measure  of  the  disturbance  rejection  and 
command  following  properties  as  a  function  of  frequency.  In  addition,  when 
the  plant  uncertainties  are  modeled  by  additive  perturbations  the  return 
difference  function  is  also  a  measure  of  robustness  and  sensitivity.  When  the 
uncertainties  are  modeled  by  multiplicative  perturbations,  the  distance  of 
the  inverse  Nyquist  locus  from  the  critical  point  provides  this  measure. 

Unfortunately,  these  well-tested  design  techniques  for  SISO  systems 
do  not  have  an  easy  generalization  to  multiple-input  multiple-output  (MIMO) 
systems.  Several  problems  unique  to  MIMO  systems  arise.  For  example,  mani¬ 
pulations  with  transfer  function  matrices  are  more  difficult  since  matrix 
multiplication  is  not  commutative.  Thus  robustness  and  sensitivity  margins 
are  dependent  on  where  the  perturbation  is  inserted  in  the  loop.  Also,  the 
perturbations  not  only  have  a  fequency  dependence,  but  a  spatial  dependence 
as  well.  In  other  words,  uncertainties  may  occur  only  in  certain  loops. 

Despite  these  problems,  some  useful  generalizations  of  the 
classical  theory  to  MIMO  systems  have  recently  been  made  [4-7].  These 
generalizations  involve  the  return  difference  matrix  or  the  inverse  Nyquist 
matrix  (these  are  the  MIMO  counterparts  of  the  scalar  functions) .  For 
example,  when  the  return  difference  matrix  of  a  nominally  stable. 


additively  perturbed  system  is  singular  at  some  frequency,  it  means  that  a 
zero  pair  of  the  characterisitc  equation  lies  on  the  imaginary  axis .  Thus 
for  all  stable  additive  perturbations  that  are  smaller  than  the  smallest 
perturbation  that  causes  singularlity ,  the  system  will  be  stable.  Hence  the 
distance  of  the  return  difference  matrix  from  singularity  provides  a  measure 
of  the  stability  margin  of  the  system  under  stable  additive  perturbations. 
This  is  a  direct  generalization  of  the  SISO  case  where  the  distance  of  the 
Nyquist  locus  from  the  critical  point  represents  the  degree  of  stability, 
which  is  measured  by  the  classical  gain  and  phase  margins.  Furthermore,  it 
can  be  shown  that  the  return  difference  and  inverse  Nyquist  matrices  also 
reflect  the  sensitivity,  disturbance  rejection,  and  I/O  properties  of  MIMO 
systems . 

When  the  standard  Euclidean  coordinates  and  corresponding  norm  are 
used  to  describe  the  system,  this  distance  from  singularity  can  be  conve¬ 
niently  computed  in  terms  of  matrix  singular  values.  Several  singular  value 
inequalities  for  robustness  and  sensitivity  under  additive  or  multiplicative 
perturbations  are  available  [8]. 

The  design  requirements  on  robustness,  sensitivity,  disturbance 
rejection  and  I/O  response  translate  into  bounds  on  these  singular  value 
curves  (as  functions  of  frequency).  Thus  a  major  design  objective  is  to 
synthesize  a  controller  that  adjusts  the  singular  value  curves  of  the  return 
difference  or  inverse  Nyquist  matrix  into  some  desirable  shape.  If  the 
mathematical  structure  of  the  controller  is  predetermined,  it  is  possible 
to  calculate  the  gradient  of  these  singular  values  with  respect  to  the 
adjustable  parameters  of  the  controller  (or  control  parameter  vector) .  The 
gradient  will  be  a  function  of  the  control  parameter  vector  and  frequency. 


Given  an  initial  control  parameter  vector  and  a  constant  frequency,  the 
gradient  indicates  how  to  change  the  initial  control  parameter  vector  to 
have  the  greatest  possible  effect  on  the  singular  value  curve  at  that 
frequency . 

In  order  to  deform  the  entire  singular  value  curve  to  fit  specifi¬ 
cations,  it  will  generally  be  necessary  to  follow  an  iterative  select- 
evaluate-adjust  design  procedure.  Each  time  a  new  control  parameter  vector 
is  selected,  the  singular  values  must  be  analyzed.  If  the  singular  values 
are  not  satisfactory,  the  gradient  can  be  recalculated  at  the  frequency  of 
interest,  and  the  control  parameter  vector  readjusted.  Then  the  singular 
values  are  analyzed  again,  and  so  on. 

The  success  of  the  design  depends  on  the  capability  of  the  designer, 
to  use  the  gradient  information  along  with  his  insight  into  the  particular 
system  to  determine  how  to  adjust  the  parameter  vector  to  set  the  desired 
effect  on  the  closed  loop  system.  This  design  method  is  obviously  highly 
dependent  on  the  particular  problem  under  consideration.  However,  if  the 
designer  is  using  a  standardized  synthesis  method  at  each  design  iteration 
to  adjust  the  controller,  such  as  the  linear  quadratic  (LQ)  design  method, 
the  gradient  computations  can  be  formulated  explicitly. 

The  solution  of  the  full  state  feedback  LQ  problem  is  a  constant 
gain  feedback  matrix  that  optimizes  a  cost  functional  which  contains  weighted 
quadratic  state  and  control  terms  [9].  These  weights  are  the  choice  of  the 
designer,  and  determine  the  feedback  gain  matrix.  An  advantage  of  the  LQ 
desl  a  method  s  that  it  provides  a  systematic,  numerically  feasible  method 
for  cheesing  a  feedback  gain  matrix  with  relatively  small  gains  (if  all  of 
the  state  weights  are  set  to  zero  then  the  minimum  energy  regulator  solution 


results) .  By  scaling  these  weights  the  LQ  method  can  be  used  for  approxi¬ 
mate  pole  placement  [10].  Also,  the  LQ  loop  has  Inherently  good  robustness 
and  sensitivity  properties  [11], [12]. 

A  disadvantage  of  the  LQ  design  is  that  it  is  not  directly  obvious 
how  to  choose  these  weights  to  achieve  a  given  set  of  design  specifications. 

In  particular,  it  is  not  at  all  obvious  how  to  choose  these  weights  to  alter 
the  singular  values  of  the  return  difference  and  inverse  Nyquist  matrices, 
and  thus  affect  the  robustness  and  sensitivity  properties  of  the  system. 

In  this  thesis,  explicit  formulas  for  the  gradient  of  the  singular 
value  functionals  of  the  return  difference  and  inverse  Nyquist  matrices  are 
derived  for  the  LQ  problem.  This  gradient  information  can  be  used  iteratively 
to  tune  the  state  and  control  weights  to  shape  the  singular  values  as  a 
function  of  frequency  and  thus  obtain  desired  robustness  and  sensitivity 
properties . 

In  Section  2,  basic  properties  of  singular  values  are  reviewed, 
then  the  applications  of  singular  value  analysis  in  control  theory  such  as 
multivariable  Bode  plots  and  robustness-sensitivity  bounds  are  discussed. 

The  Lyapunov  operator  is  defined,  and  the  differential  and  gradient  concepts 
are  reviewed.  Sensitivity  formulas  for  the  eigenvalues  and  singular  values 
of  a  matrix  are  developed. 

Section  3  contains  a  discussion  on  MIMO  loop  shaping  in  the  fre¬ 
quency  domain,  and  how  desirable  system  properties  translate  into  require¬ 
ments  on  the  smallest  singular  value  of  the  return  difference  matrix  and  the 
Inverse  Nyquist  matrix.  Following  this,  general  sensitivity  formulas  for 
these  singular  value  functionals  are  derived,  and  an  iterative  design  method 


is  proposed  for  manipulating  these  functionals.  This  theory  is  then  applied 
to  the  LQ  problem  to  obtain  specific  results . 

Finally,  Section  4  presents  an  example  application  of  the  method 
developed  in  Section  3.  The  design  method  of  Section  3  and  an  asymptotic 
design  method  are  compared.  Section  5  summarizes  the  thesis  and  discusses 
further  research  possibilities . 


2.  SINGULAR  VALUES  IN  CONTROL  THEORY 


In  this  section  the  basic  background  needed  for  the  presentation 
of  Section  3  is  reviewed.  Singular  values  and  their  role  in  the  analysis  of 
robustness  and  sensitivity  of  feedback  systems  are  discussed  in  general. 
Vector  space  and  gradient  concepts  are  reviewed  as  a  preliminary  to  dif¬ 
ferential  eigenvalue  and  singular  value  formulas. 


The  Singular  Value  Decomposition  and  Near  Singular  Matrices 


In  control  theory  it  is  often  desirable  to  measure  the  nearness  to 
singularity  of  a  matrix.  The  following  theorem  provides  such  a  measure. 
Theorem  2.1:  Suppose  that  A  and  AA  are  n*n  matrices,  and  that  the  inverse  A 
exists.  Then,  the  inverse 


(A+ AA)' 


(2.1) 


exists  if 


I  AAl  <  I  A_1I -1 


(2.2) 


where  I  •!  is  the  standard  induced  Euclidean  norm.  □ 

Proof ;  [13] . 

The  matrix  AA  is  an  arbitrary  additive  perturbation.  Since  the 
inverse  in  equation  (2.1)  will  exist  for  every  perturbation  that  satisfies 
equation  (2.2),  the  magnitude  of  the  functional  fl A  'S  ^  indicates  how  sensi¬ 
tive  a  matrix  is  to  changes  in  its  entries.  For  example,  if  the  value  of 
-1  “1 

I A  I  is  small,  then  a  small  change  in  an  element  of  A  could  cause 
singularity. 

Notice  also  that  equation  (2.2)  is  only  a  sufficient  condition.  This 
means  that  the  matrix  A+AA  may  or  may  not  be  singular  for  a  given  pertur¬ 
bation  that  violates  equation  (2.2). 


A  singular  value  of  an  arbitrary  complex  valued  mxn  matrix  A  is 


denoted  by  o^(A),  and  is  defined  by 


for  each 


oi(A)  -  Ai(AHA) 


i  *  1, . . . 


where 


l  —  min{m,n}. 


(2.3) 


Here,  A  (•)  is  the  ith  eigenvalue  of  the  indicated  matrix.  Notice  that 

singular  values  are  defined  for  nonsquare  matrices  in  general.  Also,  singular 

values  are  always  nonnegative  real  numbers  since  any  positive  semidefinite 

► 

hermitian  matrix  has  eigenvalues  that  are  real  and  nonnegative. 

Theorem  2.2:  Any  matrix  A€  CmXn  of  rank  l  has  a  singular  value  decomposition 
given  by 

A  -  UEV11  (2.4) 

where  U€  Cm  ^  and  V €  Cn*^  are  unitary  matrices  and  where 


with 


Z  -  diag{o1,o2,...,oJl} 


°1  “  a2  “ 


1  a 


V 


(2.5) 

(2.6) 


Proof ;  [13],  |- 

From  equation  (2.4), 

AHAV  -  V22.  (2.7) 

Hence  the  columns  of  V  are  orthonormal  eigenvectors  of  the  hermitian  matrix 
A  A.  These  vectors  are  denoted  v^  and  are  called  the  right  singular  vectors. 
By  convention,  the  vi  in  equation  (2.7)  are  ordered  so  that  inequality  (2.6) 
is  satisfied.  Also  from  equation  (2.4), 


The  columns  of  U  are  denoted  and  are  termed  the  left  singular  vectors. 
Some  useful  properties  of  singular  vectors  are  given  below. 

Corollary  2.3: 

(1)  The  right  and  left  singular  vectors  are  related  by 


Av  ■  o  u 
1  11 

.H 

A  ui  "  Vi* 


(2.9) 


(2) 


If  A  is  invertible,  where  As  Cn*n,  then  the  singular  values  of  A  ^  are 
related  to  those  of  A  by 


a4  (a*1)  ■  - 

1  ,(A) 


,  1  ■  1, . . . ,n. 


(2.10) 


n+l-i 


(3)  The  induced  Euclidean  norm  is  given  by 


II  All  2  -  o1(A). 


(2.11) 


Proof :  [13]. 

Property  (2)  allows  Theorem  2.1  to  be  interpreted  in  terms  of 
singular  values.  Equation  (2.2)  is  equivalent  to 


0(AA)  <  o(A_1) 

where  . 

3(0  -  o1(0 

£(0  ~  0  (*)• 
—  n 


(2.12) 


(2.13) 


Inequality  (2.12)  plus  the  fact  that  there  exists  a  perturbation  AA  such 
that  0(AA)*£(A)  and  A+AA  is  singular  means  that  the  smallest  singular  value 
of  a  matrix  is  the  distance  from  that  matrix  to  the  nearest  singular  matrix. 


T 


Singular  Values  in  Control  Theo 


Suppose  that  P*(ja>)  is  a  mxn  rational  transfer  function  matrix  of 
a  multiple  input-multiple  output  (MIMO)  linear  system,  with 


y(jai)  -  P(jaj)u(jaj)  ,  y€Cn,  u€  C°. 


(2.14) 


Then,  it  has  been  shown  [14]  that  for  all  a>, 


(2.15) 


where  a(P*(j<o))  is  the  least  upper  bound  and  £(P*(jw))  is  the  greatest  lower 
bound  of  the  above  vector  ratio. 

If  we  let  d  u( j cu) 9  •  1,  then  the  output  II  y (j oo)  d  is  bounded  between 
the  two  singular  values  at  a  given  a>  for  any  input  direction.  A  frequency 
plot  of  these  singular  values  can  be  regarded  as  the  MIMO  generalization  of 
a  single  input-single  output  (SISO).  Bode  magnitude  plot  (see  Figure  2.1). 

The  interpretation  of  a  Bode  magnitude  plot  as  a  rms  sinusoidal 
power  gain  curve  also  carrys  over  to  the  MIMO  case  [15],  Suppose  that  an 


input 


u(t)  ■  u  cos  U)t 


(where  u  is  a  vector  of  constants)  is  applied  to  the  system  of  equation  (2.14) 
The  sum  of  the  mean  squared  power  of  each  input  signal  is 


,,  U  T 

oj  r  T 


SMSPI  ■  -t—  /  u  (t)u(t)dt  ■  j  uu. 


The  sum  of  the  mean  squared  power  of  the  outputs 


SMSFO  (Oy  (C)dt 


where  y  (t)  Is  a  vector  of  steady  state  (sinusoidal)  outputs.  Then,  the 
ss 

ratio  of  mean  squared  power  out  to  mean  squared  power  in  is  bounded  by  the 
squares  of  the  singular  values  of  for  every  input  direction  u 

£(P*(jw))2  <  fj§~  <  a(P*(ju>))2. 

Singular  values  have  another  extremely  important  interpretation  in 
the  robustness  analysis  of  MIMO  systems.  A  closed  loop  feedback  system  is 
said  to  be  robust  if  it  remains  stable  when  the  true  plant,  P(ju)  varys  from 
the  nominal  model  P*(ju>)  that  was  used  to  design  the  feedback  controller. 

This  difference  between  the  real  plant  and  the  mathematical  repre¬ 
sentation  can  be  modeled  in  many  ways.  Three  models  which  have  proven  useful 
for  analysis  are; 

(1)  Additive  perturbations 

P(jo))  -  P*<jo>)  +  APA(joj)  (2.16) 

where  P(ju))  is  the  true  plant,  P*(jw)  is  the  nominal  model,  and  APA(jw) 
is  an  unknown  quantity. 

(2)  Multiplicative  input  perturbations 

P(Jui)  -  P*(ju>)(I  +  APM(juj)).  (2.17) 

(3)  Multiplicative  output  perturbations 

P(ju>)  -  (I+APM(ju)))P*(ju>).  (2.18) 

Bounds  expressed  in  terms  of  singular  values  have  been  derived 
that  relate  the  nominal  plant  to  the  magnitude  of  the  perturbation  that  can  be 
tolerated  before  stability  is  no  longer  assured.  Consider  the  case  of  additive 
perturbations  (see  Figure  2.2)  where  P*(Ju>)  is  an  m*n  nominal  plant  transfer 
function,  and  K(Ja>)  is  the  n*m  controller. 


Theorem  2.4 :  Assume  : 

(1)  P*(jw),  AP^(jco) ,  and  K(joj)  are  rational  transfer  function  matrices 

(2)  APA(joi)  is  stable. 

(3)  The  nominal  closed  loop  system  (AP.(jaj)  =  0)  is  stable. 

A 

Then,  the  closed  loop  system  described  by  solid  lines  in  Figure  2  is  stable 
for  all  APA(jo>)  which  satisfy 

£(I  +  P*(ju)K(J«))  >  5(APA(ja)))5(K(ju)))  (2.19) 

for  all  u€  R. 

Proof :  [15].  C 

The  utility  of  this  theorem  lies  in  the  fact  that  only  knowledge 
of  the  norm  of  the  perturbation  matrix  is  required  to  ensure  stability. 

Equation  (2.19)  can  also  be  Interpreted  geometrically  as. a  generali 
zation  of  the  Nyquist  criterion  for  SISO  systems.  The  left  hand  side  of 
equation  (2.19)  is  analogous  to  the  distance  of  the  SISO  Nyquist  locus  from 
the  critical  point  -1+jO  in  the  complex  plane  [15]. 

A  similar  theorem  holds  in  the  case  of  multiplicative  perturbations 
Consider  the  system  in  Figure  2.3. 

Theorem  2.5:  Assume  : 

(1)  P*(ju),  AP„(1oj)  ,  and  K(jui)  are  rational  transfer  function  matrices. 

M 

(2)  APM(jco)  is  stable. 

(3)  The  nominal  closed  loop  system  (AP„(jto)  =  0)  is  stable. 

M 

(4)  det{P*(joj)K(ju)  }  t  0. 

Then  the  perturbed  system  is  stable  for  all  P^(ju)  which  satisfy 

£(I+ [P*(jw)K(ju))]'1)  >  a(APM(jo>))  (2.20) 

for  all  u  €  R  . 

Proof :  [4] .  □ 


Here,  the  robustness  is  measured  with  respect  to  the  inverse 
Nyquist  matrix  1+  [P*(ju))K(jui)  ]  ^  rather  than  the  return  difference  matrix 
as  in  equation  (2.19). 

It  is  important  to  be  aware  that  equation  (2.20)  is  valid  only 

for  perturbations  that  are  at  the  output  of  the  plant,  as  shown  in  Figure  2.3 

If  the  perturbation  was  applied  at  the  input  of  the  plant  (the  positions  of 

P*(jtu)  and  I  +  AP  (joi)  would  be  reversed  in  Figure  2.3)  equation  (2.20) 

M 

becomes 

£(I+  [K(ja))P*(ju>)]_1)  >  5(APM(ja>)) 

where  the  matrix  K(jai)P*(jai)  is  nxn  rather  than  mxm.  For  the  sake  of  con¬ 
sistency,  all  of  the  following  theorems  in  this  section  will  deal  with 
output  perturbations.  Results  for  input  perturbations  are  similar  and  can 
be  derived  from  the  references  given  for  each  theorem. 

The  above  theorem  shows  how  multiplicative  perturbations  impose 
limits  on  the  size  of  the  closed  loop  bandwidth  u>  .  (We  define  the  bandwidth 
of  a  MIMO  system  as  the  frequency  at  which  the  largest  singular  value  of  the 
transfer  function  matrix  drops  to  1//2  of  its  zero  frequency  value.)  The 
relation 

[1+  [P*(jaj)K(ja))]"13  1  -  P*(ju))K(juJ)[I  +  P*(ju))K(ja))]‘1  (2.21) 

plus  equation  (2.10)  implies  that  the  smallest  singular  value  of  the  Inverse 
Nyquist  matrix  is  just  the  reciprocal  of  the  largest  singular  value  of  the 
transfer  function  matrix.  At  frequencies  where  large  multiplicative  per¬ 
turbations  are  possible,  the  above  theorem  requires  that 
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2.(1  +  (P*(jo))K(jo))]'1) 


be  large,  and  hence 


be  small. 


a(P*(ju))K(jaO  [I  +  P*(ju)K(ja>)  ]"  ) 


The  effect  of  modeling  uncertainties  and  parameter  variations  on 


input-output  response  in  MIMO  system  can  also  be  measured  in  terms  of 
singular  values.  One  way  to  do  this  that  provides  a  logical  generalization 
of  the  SISO  case  is  to  define  a  nominally  equivalent  open  loop  system,  and 
then  deduce  conditions  that  require  the  output  of  the  closed  loop  system  to 
be  less  sensitive  to  perturbations  [5].  Consider  the  system  described  by 
solid  lines  in  Figure  2.4  where 


K  (joj)  -  K(juj)  (I  +  P*(ju))K(jw) )-1  . 
o 


(2.22) 


Comparing  this  open  loop  system  to  the  closed  loop  system  of  Figure  2.2, 
we  see  that  when  AP^(jaj)  =0  the  two  systems  will  respond  identically  for 
any  given  input  U  .  For  the  case  AP  (jw)  =  0,  y  (joe)  will  denote  the  output 

C  A  O 

of  the  open  loop  system,  and  yc  will  denote  the  output  of  the  closed  loop 
system.  When  AP  (jw)  jSO,  these  outputs  will  be  denoted  by  y'(ju)  and  y’(ju)). 
The  open  loop  and  closed  loop  errors  are  defined  by 


eQ(jw)  -  yo(jo>)  -  y^(joj) 
ec(jw)  »  yc(jw)  -  y^(jo>) 


(2.23) 


The  open  loop  and  closed  loop  systems  can  be  compared  through  the 


mean  squared  errors,  which  are  defined  to  be 


J  -  /  ie  (t)  J  at 
c  0  C 

where  e  (t)  and  e  (t)  are  the  inverse  Fourier  transforms  of  equation  (2.23). 
o  c 

Theorem  2.6:  Suppose: 

(1)  K(ju) ,  P*(ju>)  ,  and  APA(jai)  are  rational  transfer  function  matrices. 

(2)  APA(j(o)  and  (1  +  P*(j<u)K(ju>)  )-1  are  stable. 

(3)  The  perturbed  system  of  Figure  2.2  is  stable  (i.e.  APA(joj)  satisfies 
equation  (2.19)). 

If  APA(joi)  satisfies 

o(I  +  P*(joi)K(joj))  >  a(APA(ju))5(K(ju)))  +  1  (2.25) 

for  all  a)  in  some  interval  Jl,  and  u  (Iuj)  ■  0  for  all  then 

c 

J  <  J  .  (2.26) 

c  o 


Proof:  [8], 

When  the  conditions  of  this  theorem  are  satisfied,  the  output  of  the 
closed  loop  system  in  Figure  2.2  will  be  less  sensitive  to  additive  pertur¬ 
bations  than  the  output  of  the  equivalent  open  loop  system.  Under  these 
conditions,  feedback  has  a  desensitizing  effect. 

The  sensitivity  condition  (2.25)  is  remarkably  similar  to  the 
stability  condition  (2.20).  Indeed,  if  a  system  has  good  sensitivity  pro¬ 
perties  under  additive  perturbations,  it  must  necessarily  also  have  good 
stability  margins  under  additive  perturbations  in  the  frequency  band  — ^ • 

A  similar  result  holds  for  multiplicative  perturbations .  Consider 


the  multiplicatlvely  perturbed  systems  in  Figures  2.3  and  2.5,  with  errors 
defined  as  in  equation  (2.23). 


Nominally  equivalent  multiplicatively  perturbed  open 
loop  system. 


Theorem  2.7;  Suppose 

(1)  APM(joj),  P*(ju)),  and  K(jco)  are  rational  transfer  function  matrices. 

(2)  AP  (ju»)  and  (1+ P*(jui)K(ju))'1  are  stable. 

M 

(3)  AP  (jto)  satisfies  equation  (2.20). 

n 

(4)  det{P*(jto)K(jw)  }  ^  0. 

If  AP  (jto)  satisfies 

0(1+  [P*(jai)K(jm)]‘1)  >  a(APM(joj))  +  5([P*(Ju)K(ju)  J’1)  (2.27) 

for  all  u)  in  some  interval  J> ,  and  u  (joi)  *  0  for  all  J? ,  then 

c 

J  <  J  . 
c  o 

Proof :  [8],  □ 

Since  3(  (P*(ja))K(joj)  ]”^)  is  a  nonnegative  number  ,  we  again  see 
that  good  sensitivity  properties  of  a  multiplicatively  perturbes  system  imply 
good  stability  margins  (see  equation  (2.20))  for 

Also,  the  sensitivity  conditions  (2.25)  and  (2.27)  need  only  be 
satisfied  in  a  finite  bandwidth.  The  robustness  conditions  (2.19)  and 
(2.20)  must  be  satisfied  for  all  u. 

Another  important  fact  is  the  the  robustness  sensitivity  theorems 
for  additive  perturbations  (Theorems  2.4  and  2.6)  are  conservative.  For 
example,  there  does  not  necessarily  exist  a  destabilizing  APA(jo>)  which  will 
satisfy  equation  (2.19)  with  equality.  However,  if  additive  perturbations  are 
considered  in  the  configuration  indicated  by  the  dotted  line  in  Figures  2.2 
and  2.4,  the  bounds  (2.19)  and  (2.20)  are  not  conservative. 
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IT! 


.3.  Linear  Operators  and  the  Lyapunov  Equation 


It  is  easily  demonstrated  that  the  collection  of  all  m  n  complex 
valued  matrices  (<EmXn)  is  a  linear  vector  space.  A  useful  inner  product  on 


this  space  is 


<MrM2>  *  tr{^,M2>  ,  M1,M2e  €?“xn. 


(2.28) 


The  natural  norm  associated  with  this  inner  product  is  the  Frobenius  norm 


<M1,M1> 


tr{M^M1> 

I  l  m,,r 

i=l  j*l  ij 


im^f* 


(2.29) 


A  vector  space  is  complete  under  a  given  norm  if  every  Cauchy 
sequence  of  vectors  in  the  space  converges  to  a  vector  which  is  also  a  member 
of  the  space.  A  complete,  normed,  linear  vector  space  is  a  Banach  space. 

If,  in  addition,  an  inner  product  which  induces  the  norm  is  defined,  the  space 
is  a  Hilbert  space.  Under  the  inner  product  defined  in  equation  (2.28). 

Cmxn  is  a  Hilbert  space. 

A  concept  that  will  be  useful  later  is  cartesian  product  Hilbert 
space.  A  cartesian  product  of  two  Hilbert  spaces  is  a  collection  of  all 
ordered  pairs  consisting  of  one  vector  from  the  first  space  and  one  vector 
from  the  second.  For  example. 


-  A  „mxn  _mxm 
S  —  C  x  c 


is  a  cartesian  product  space.  An  element  of  this  space  is  written 


(2.30) 


F 

r.'. 

t 

m 


fe 

v- 

I  ;■ 


I 


V 

\ 


p 


r  ■ 


^  • 

C 
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An  inner  product  is  defined  on  a  Cartesian  product  space  by  adding 
the  inner  products  from  each  constituent  space.  For  example,  the  inner 
product  on  S  is 

(  (M1,N1),(M2,N2)>  -  <MrM2>  +  (N1,N2> 

-  tr{^M2>  +  tr{N^N2}.  (2.31) 

Under  this  inner  product  the  Cartesian  product  of  two  Hilbert  spaces  is 
itself  a  Hilbert  space. 

The  concepts  of  operators  and  adjoints  will  be  useful  in  the 
derivation  of  the  gradient  in  the  following  section.  Suppose  «3r: X  +lf  is  a 
linear  operator  where  and  if  are  inner  product  spaces.  If  there  exists 
another  operator  <£*  :  Tf-*&  that  satisfies 

<y,4(x)>  -  Ct^ybx)  ,  Vx€J&,  y etf.  (2.32) 

then  Jf*  is  called  the  adjoint  of  3.  When  JC-  and  if  are  Hilbert  spaces, 
the  adjoint  always  exists  and  is  unique.  The  adjoint  operator  has  the 
following  properties  [16]  : 

(1)  The  adjoint  operator^*  is  linear. 

(2)  If  J  has  an  inverse  then 

0*"1)*-  (Jf*)-1.  (2.33) 

It  will  be  of  interest  to  us  to  examine  the  Lyapunov  equation  in 
the  context  of  operators  and  adjoints.  First,  recall  the  following  funda¬ 
mental  properties  of  the  Lyapunov  equation. 

Theorem  2.10:  Consider  the  Lyapunov  equation 


KA  +  A*^  +  Q  -  0 


(2.34) 


A,K,Q€  C 


•3 
•J 

and  A  is  stable  (i.e.  Re{X^(A)}>0,  i-l,...,n).  Then 

(1)  For  any  QS  Cnxn  there  exists  a  unique  solution  K€  lnxn. 

(2)  If  Q  is  Hermitian  then  K  is  Hermitlan. 

(3)  If  Q  is  positive  semidefinite  then  K  is  positive  semidefinite. 

Proof : 

(1)  [17] 

(2)  This  follows  directly  from  (1) . 

(3)  [17].  □ 

Since  the  Lyapunov  equation  assigns  a  unique  solution  K  to  every 
input  Q,  it  can  be  thought  of  as  an  operator  that  maps  from  CnXn  to  Cnxn. 

Define  the  linear  operator  as 

La(K)  -  a\  +  AK  (2.35) 

where  A  is  always  assumed  to  be  stable.  Equation  (2.34)  is  then  equivalent  to 

La(K)  -  -Q.  (2.36) 

This  linear  operator  has  several  very  useful  properties.  First, 

it  follows  from  Theorem  2.10  that  L.(*)  is  one  to  one  and  onto.  Therefore 

A 

an  inverse  Lyapunov  operator  exists  and  is  linear.  It  will  be  denoted  L  (*)_1 

A 

where 

la1(q)  "  "K-  (2.37) 

Second,  since  (Enxn  is  a  Hilbert  space  under  the  inner  product  defined  in 


equation  (2.28),  the  adjoint  operator  of  L.(0  exists. 


Theorem  2.12:  Consider  the  linear  operator  on  the  Hilbert  space  C 
with  the  inner  product  defined  in  equation  (2.28).  Then,  the  adjoint 
operator  L*(*)  is  given  by 

laCO*  -  la<*)  •  (2.38) 

Proof :  From  equations  (2.28)  and  (2.32)  we  have 
<Y,La(K)>  -  trtt^OC)},  Y,K€CnXn. 

By  the  definition  of  L^, 

<Y,L  (K) )  -  tr{YH(KA+AHK)} 

A 

-  trtY11^}  +  tr{YHA\}. 

Using  elementary  trace  properties, 

<Y,La(K)>  «  ttiifopY}  +  tr{KHAY} 

A 

-  tr  {kV}  +  tr{KHAY} 

-  tr{KH(YAH  + AY)} 

-  tr{KHLAH(Y) } 

-  <K,LaH(Y)> 

which  implies  ^A(’)*“  1^H(*)  by  equation  (2.32).  □ 

Another  concept  which  will  be  useful  in  the  next  chapter  is  the 
square  root  of  a  matrix.  A  matrix  M  is  a  square  root  of  a  symmetric  matrix 
Q  whenever 

MTM  -  Q.  (2.39) 

Under  certain  conditions,  the  square  root  M  will  always  exist,  although  it  is 
in  general  nonunique. 


I 
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Theorem  2.13;  Suppose  that  Q  is  a  n*n  real,  symmetric,  positive  semi- 
definite  matrix  with  rank  m.  Then  there  exists  a  mxn  square  root. 

Proof ;  Straightforward.  □ 

Observe  that  if  Q  has  an  mxn  square  root  M,  then  there  will  be  a  JUn  square 
root  for  every  £>m. 

Finally,  the  following  result  pertaining  to  differentiable  matrices 
is  necessary  for  the  derivations  in  the  next  chapter. 

Theorem  2.14:  If  V(t)  is  an  nxn  matrix  whose  entries  are  real  valued, 
differentiable  functions  of  t,  and  the  inverse  V  *(t)  exists  for  all  t,  then 

V-1(t)  -  -V_1(t)V(t)v"1(t) .  (2.40) 

Proof :  The  proof  is  obvious  from  the  fact  that 

V(t)V-1(t)  -  I 

and  the  product  rule  for  differentiation.  □ 


! 

2.4.  The  Differential 

The  concept  of  the  scalar  differential  can  be  extended  to  operators 
and  vector  spaces .  Suppose  that 

(2.41) 


I  where  and  are  Hilbert  spaces.  If  there  is  an  operator  6JI  (x;Ax)  that 

is  linear  and  continuous  with  respect  to  AxSJt,  and  if  6 Jf  (x;  Ax)  satisfies 

11.  -J(»)  -tf  .  „  (2.4 

I 


then  <5Jf(x;Ax)  is  the  Frechet  differential  of  J  at  x  in  the  direction  Ax 


*  V*  .'*■  •  .  *  . 

v.  .....  .  , 

r, 

I*4 
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a 

Notice  that  the  differential  assigns  a  vector  y  to  every  ordered  pair 

(x,Ax) , 

•  •  « 

6^: 

(2.43) 

n 

Since  S*(x;Ax)  is  linear  in  Ax,  it  can  be  written  (for  each 

xejfc) , 

■  •  # , 

S*(x;Ax)  ■•Jf'(x) Ax 

(2.44) 

r- 

where  «*'  is  a  bounded  linear  operator  that  depends  on  x. 

Btz,y) . 

i 

Here,  denotes  the  space  of  all  bounded  linear  operators 

from  A  to  tfr. 

f  ■. 
t--; 

.  '  ■ 
v- 

3?  is  called  the  Frdchet  derivative  of  at  x. 

Many  of  the  properties  of  scalar  derivatives  extend  to 

Frgchet 

p 

derivatives,  such  as  the  familiar  chain  rule. 

Theorem  2.16  (Chain  Rule):  Suppose  that  *  and  Jf  are  operators 

•  '# 

* 

JC :  t 

(2.45) 

where  X,  ty,  y  are  Hilbert  spaces.  Suppose  further  that  the  Frdchet  deriva- 

-* 

tive,*'  exists  for  all  x€A  and  that  Jf'  exists  for  all  y€tf  . 

Then, 

the 

operator 

JC'Jf 

(2.46) 

V 

has  a  Frdchet  derivative  given  by 

h  , 

W"; 

(JO!*)  ’  (x)  -  Jf'  (*(x) )  j!  (x)  . 

(2.47) 

r: 

<  ■ 

Proof:  [16]. 

□ 

P 

-*■*  4’*  «  '*  .*• 

The  chain  rule  also  applies  to  Frdchet  differentials.  It  follows 


from  equation  (2.47)  that  the  differential  of  the  composite  operator  in 
equation  (2.46)  is 

63C»«£(x;Ax)  ■  63C(«£(x)  ;  6<^(x;Ax))  (2.48) 


where  7C  and  <3  are  under  the  same  assumptions  as  in  the  theorem. 

The  geometrical  interpretation  of  the  Frdchet  differential  is 
similar  to  that  of  the  scalar  differential.  Equation  (2.42)  is  equivalent  to 
saying  that  for  all  e>0,  there  exists  6(e)  >0  such  that  if  I  Ax!  <6(e)»  then 


Since 


fl-f(x+Ax)  -^(x) -6«£(x;Ax)l  <  el  Axil  . 

HJr  (x+Ax)  -^(x)8  -  !6«*(x;Ax)8  <  l«*(x+Ax)  -«^(x)  -6«£(x;Ax)0 


we  can  see  that  for  small  I  Ax! ,  the  magnitude  of  the  differential  is  approxi¬ 
mately  equal  to  the  magnitude  of  the  change  of  <^r  (x)  in  the  Ax  direction. 

The  differential,  then,  can  be  interpreted  as  a  sensitivity  function. 

In  the  case  of  functionals,  this  concept  can  be  carried  further. 
First,  recall  that  a  functional  is  an  operator  that  maps  a  vector  space 
into  the  real  line, 

f:jf.-R.  (2.49) 

Theorem  2.15  (Riesz  Frdchet) :  If  g  is  a  bounded  linear  functional  on  a 
Hilbert  space  j(  ,  then  there  exists  a  unique  vector  h  in-4  such  that  for 


where  9gi  is  the  induced  functional  norm, 

9 SD  ~  sup  { jg(x) | }. 

I  xl  -1 

Proof :  [16].  □ 

The  true  power  of  this  theorem  is  that  it  applies  in  infinite 
dimensional  spaces.  In  this  thesis  we  will  be  concerned  primarily  with  finite 
dimensional  Euclidean  vector  space.  However,  the  above  theorem  is  still 
useful  because  it  provides  a  means  of  calculating  the  gradient  through  the 
differential. 

Consider  the  functional  f,  as  in  equation  (2.49),  over  a  Hilbert 

space  Jtp.  Since  the  differential  <5g(x;Ax)  is  linear  in  Ax,  the  Reisz-Frdchet 

theorem  guarantees  that  there  exists  a  vector  7  f  such  that 

X 

<5f(x;Ax)  ■  <7^f  (x)  ,Ax>  (2.50) 

where  7^f(xQ)  denotes  the  gradient  of  f  with  respect  to  x  at  the  point  xq. 
Recall  that  the  gradient  is  a  vector  that  points  in  the  direction  of  maximum 
increase  of  f  at  xq,  and  whose  magnitude  is  equal  to  the  value  of  the 
directional  derivative  in  that  direction. 

Equation  (2.50)  also  provides  another  geometrical  interpretation 
of  the  differential.  When  9Axfl  ■  1,  the  differential  is  just  the  gradient 
projected  in  an  arbitrary  Ax  direction.  In  this  case,  the  differential  is 
equal  to  the  directional  derivative.  This  suggests  a  way  to  calculate  the 
differential. 

Theorem  2.16;  If  the  Frdchet  differential  of  the  functional  in  equation 
(2.49)  exists,  then  it  is  unique  and  is  given  by 


Proof :  [16].  □ 

These  results  have  a  straightforward  generalization  to  Cartesian 
product  Hilbert  space. 

Corollary  2.17;  Suppose  f(x,y)  is  a  functional  over  a  Cartesian  product 
Hilbert  space, 

f:j +  (2.52) 

If  the  Frdchet  differential  of  f  exists,  it  is  given  by 

6f  ((x,y)  ;  (Ax, Ay))  -  <5f(x,y;Ax)  +  5f(x,y;Ay)  (2.53) 

where  each  of  the  differentials  on  the  right  hand  side  of  the  above  equation 
is  computed  according  to  equation  (2.51). 

Proof :  The  proof  follows  from  Theorem  2.16.  □ 

Equation  (2.50)  can  also  be  extended.  The  Riesz-Fr£chet  theorem  Implies 

6f ((x,y) ;Ax,Ay))  -  < (V  f ,V  f) , (Ax, Ay) >  -  <V  f,Ax>  +  <V  f,Ay>  (2.54) 

x  y  x  y 

where  this  last  equality  was  from  equation  (2.31). 

An  application  of  Frdchet  differentials  that  will  be  Important  in 
the  development  of  this  thesis  is  the  sensitivity  of  eigenvalues  and  singular 
values  with  respect  to  a  parameterization  of  a  matrix.  The  following  two 
theorems  provide  formulas  for  these  sensitivities. 

Theorem  2.18:  Consider  the  real  nxn  matrix  A(a)  whose  elements  are  Frdchet 


differentiable  functions  of  the  vector  a 


If  X^(ot)  is  a  distinct  eigenvalue  of  A(ct)  with  right  eigenvector  v^(a)  and 
left  eigenvector  w^(a)  then 

w^(a)5A(a;Aa)v. (a) 

<5X  (a ;Aa)  -  -L— 5 - 1 -  .  (2.56) 

wi(a)vi(a) 

Proof :  [18] .  □ 

Although  A  is  a  matrix,  it  is  not  considered  here  in  the  usual  sense  of  a 
linear  operator.  Each  entry  of  A  is  a  possibly  nonlinear  functional  of  the 
vector  o,  and  hence  A(-)  is  a  nonlinear  function  from  Rn  into  Rn  n. 

Note  that  the  hypothesis  X^  distinct  is  imposed  in  the  above 
theorem  because  for  nondistinct  X^,  the  left  and  right  eigenvectors  can  be 
orthogonal,  that  is,  w^(a)v^(a.)  ■  0. 

An  analogous  theorem  holds  for  singular  values. 

Theorem  2.19:  Consider  the  matrix  A(a)  as  in  equation  (2.55).  If  a^a)  is  a 
distinct  nonzero  singular  value  of  A(a) ,  then 

6ai(a;Aa)  *  Re{u^(a)6A(a;Ao)vi(a) }  (2.57) 

where  u^a)  and  v^(a)  are  the  left  and  right  singular  vectors  that  correspond 
to  a^(a) . 

Proof  :  From  equation  (2.3) 

oj(o)  -  Xi(AH(a)A(cl)).  (2.58) 

Applying  the  chain  rule  (equation  (2.48))  and  equation  (2.46), 

w^(a)6 [AH(a)A(a) ]v  (0) 

2a. (a) 6a. (a; Aa)  - - - -  .  (2.59) 

w^(a)vt(a) 

Here,  w^(a)  and  v^(a)  are  the  left  and  right  eigenvectors  of  the  matrix 
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H  H 

A  (a)A(ci)  .  Since  w^(a)v^(a)  is  in  the  denominator,  we  have 

n 

w^(a)v^(a)  ^  0  *►  cr^(cO  is  distinct. 

H 

Also,  since  A  (a)A(a)  is  a  Heraitian  matrix,  we  know  that 

(1)  wt  -  v± 

(2)  v«  v±  -  1. 

Hence ,  „ 

w“(a)Vi(o)  -  1. 

Applying  the  product  rule  on  the  right  hand  side  of  equation  (2.59) 
<S  [A^(a)  A(a)  ]  «  6A^(a  ;Aa)A(a)  +  A^(a)  6A(a  ;Ao)  . 

Substituting  this  and  equation  (2.60)  into  equation  (2.59), 


(2.60) 


2o^(a)6o^(a;Aa)  *  v^(a) [A^(a) 6A(o ;Aa) + 6A^(a;Aa)A(a))v^.  (2.61) 


From  equation  (2.9), 

20i60i(a;Aa)  ■  a^(a)u^(a)<5A(a ;Aa)v^ (a)  +  v^5AH(a;Aa)ui(a)ai(a) 

where  u^(a)  is  the  left  singular  vector  of  A(a) .  Then, 

20f50i(a;Aa)  •  a^(u^6A(a ;Aa)v^ +  [u^5A(a;Aa)vi]H) 

■  2o^Re{u^5A(a;Aa)v^} . 

Dividing  each  side  by  2a^(ct)  yields  equation  (2.57).  □ 

In  the  nondistinct  case,  the  singular  vectors  will  be  nonunique. 
However,  it  can  be  shown  [19]  that  there  exists  u^  in  the  left  singular 
vector  subspace  and  v^  in  the  right  singular  vector  subspace  such  that 
equation  (2.57)  holds. 


tit: 
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3.  ADJUSTMENT  OF  CONTROL  SYSTEM  PARAMETERS  USING 
SINGULAR  VALUE  SENSITIVITIES 

The  purpose  of  this  section  is  to  develop  a  method  for  modifying 
control  system  designs  to  satisfy  singular  value  inequalities.  First,  the 
translation  of  desirable  closed  loop  MIMO  system  properties  into  requirements 
on  the  singular  values  of  the  return  difference  matrix,  the  loop  transfer  ma¬ 
trix,  and  the  inverse  Nyquist  matrix  is  discussed.  Then,  general  sensitivity 
formulas  for  these  important  functionals  are  derived.  A  control  parameter 
adjustment  method  for  manipulating  the  frequency  shapes  of  these  functionals 
is  proposed.  This  theory  is  then  applied  to  the  linear  quadratic  (LQ)  problem 
to  obtain  specific  results. 

3.1.  Loop  Shaping  in  the  Frequency  Domain  for  MIMD  Systems 

Consider  the  MIMD  system  shown  in  Figure  3.1,  with 

u(jou)  €<C? 

y(j“0>  u  (jw),  d(ju>),  Tl(ju>)  €<&“. 
c 

P(j<u)  is  the  p  input,  m  output  plant,  and  K(jm,or)  is  the  m  input,  p 
output  controller.  The  vector  ot  represents  the  control  parameters,  which 
are  chosen  by  the  designer.  The  vectors  d  and  1]  are  disturbance  and  sensor 
noi3e  signals,  respectively. 

The  transfer  function  matrix  of  this  system  is  given  by 

PK[I  +  PK]‘1(uc  -  Tl)  +  [I  +  PKj"ld 


y  - 


(3.1) 
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where  Che  explicit  dependence  on  u)  and  a  has  been  suppressed  for  notational 
clarity.  Here,  we  see  some  of  the  typical  SISO  design  tradeoffs  appearing 
immediately  in  the  MIMO  case.  For  example,  consider  the  above  equation  with 
t]  «  u^  »  0.  The  effect  of  the  disturbance  on  the  output  is 

y  *  [I  +  PK]_1d. 

Taking  norms  on  each  side  and  using  equations  (2.10)  and  (2.11)  we  obtain  the 
inequality 

II  y  I!  —  a  (  I  +  PK)  II  ^  II’  (3«2) 

For  disturbance  rejection  we  must  therefore  have 

a  (  I  +  PK)  >  1  (3.3) 

at  frequencies  where  disturbances  may  be  large.  This  translates  into  a 
condition  on  the  loop  gains  through  the  singular  value  inequality 

|  a  (  I  +  PK)  -  a  (  PK  )  |  <  1.  (3.4) 

When 

£  (PK)  >  >  1  (3.5) 

condition  (3.3)  is  implied.  This  requirement  of  large  loop  gains  and  large 
return  difference  matrix  for  disturbance  rejection  is  analagous  to  the  scalar 
case. 

A  system  has  good  command  following  properties  if  the  error  between 


the  input  and  the  output  is  small  when  no  sensor  noise  or  disturbance  is 
present.  From  equation  (3.1),  we  see  that  good  command  following  requires 


PK[I  +  PK] 


(3.6) 


in  the  frequency  range  where  the  command  signal  has  significant  energy.  If 
e(ju>)  is  defined 

e(jtu)  -  y(jcu)  -uc(jtu)  (3.7) 

then  the  relation 

e  -  [I  +  PKj_lu  (3.8) 


is  obtained  by  substituting  equation  (3.1)  into  (3.7)  with  T[  «  d  ■  0.  A.s 
before,  condition  (3.3)  implies  that  ||  e  ||  will  be  attenuated  and  that  equa¬ 
tion  (3.6)  will  be  satisfied. 

The  first  limitation  on  making  the  loop  gains  arbitrarily  large  is 
imposed  by  the  sensor  noise  T|.  From  equation  (3.1),  the  noise  to  output 
transfer  function  matrix  is  just  the  negative  of  the  command  to  output  ma¬ 
trix.  Thus,  conditions  (3.5)  and  (3.3)  and  equation  (3.6)  imply  that  the 
sensor  noise  will  be  passed  directly  to  the  output.  From  equation  (3.1), 


I!  y  II  < 


g  (PK  ) 
a  (I  +  PK) 


i!  mi 


(3.9) 


and  so  it  is  desirable  to  have 

g  (  PK  )  <1  (3.10) 

at  frequencies  where  the  command  signal  is  dominated  by  the  sensor  noise. 
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There  are  other  more  severe  restrictions  on  the  loop  gains  imposed  by  robust¬ 
ness  and  sensitivity  considerations. 

A  closed  loop  system  is  said  to  be  robust  when  it  remains  stable  under 

plant  perturbations.  These  perturbations  represent  the  difference  between 

£ 

the  true  plant,  P(juu),  and  the  mathematical  model,  P  (jtu).  As  discussed  in 
Section  2,  this  difference  can  be  modeled  by  additive  or  multiplicative  per¬ 
turbations.  In  the  case  of  additive  perturbations  that  surround  the  plant 
and  controller,  the  sufficient  condition  for  closed  loop  stability  (see  Thm. 
2.4)  is 

2  (I  +p*<j<n)K(Juj,a)  )  >  \(Juj)  V  u>  (3.11) 

where  JL  (jou)  is  the  uncertainty  magnitude  bound.1  For  good  robustness  pro- 

A 

perties,  the  left  hand  side  of  the  above  equation  should  be  made  as  large  as 
possible  through  choice  of  the  parameter  vector  a. 

When  the  difference  between  P(juu)  and  P*(jtu)  is  modeled  by  a  multipli¬ 
cative  perturbation,  the  sufficient  condition  for  stability  (see  Thm.  2.5)  is 

2.(1  +  (P*(j<u)K(ju),a))'1  >  j^(jc»)  V  uj  (3.12) 

If  ^w(j<u)  >>  1  ,  then  by  equation  (3.4) 

M 


^The  uncertainty  magnitude  bound  i^(juu)  is  a  positive  real  valued  function  of 
u).  It  defines  a  class  of  perturbations  AP^  where  each  member  of  the  class 
is  a  mxm  transfer  function  matrix  that  is  stable,  rational,  and  has  MAP.  !|  < 

yj«>.  A  ' 
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2d  +  [P*(juj)K(jaj,a)]"1)  —  2([P*(j^)K(jaj,0')]’^) 


[<7(P*(ju))K(juJ,a))]“1 


(3.13) 


and  this  reduces  to  condition  (3.10). 

Similar  conditions  arise  from  sensitivity  analysis.  It  has  been  shown 
[5]  that 

ec(jtu)  *  [I  +  P(ju>)K(j‘",»)]’1eo(joj)  (3.14) 

where  e  (ju>)  and  e  (jou)  are  defined  in  eqn.  (2.23).  This  again  implies  that 
c  o 

the  return  difference  matrix  (eqn.  3.3)  should  be  large  in  the  frequency  range 
of  interest.  Notice  that  the  true  plant  P(j(u)  appears  in  the  above  equation. 
This  equation  can  be  expressed  in  terms  of  the  nominal  model  and  the  perturba¬ 
tion  magnitude  bound,  as  shown  in  Theorems  2.6  and  2.7.  Here,  we  get  the 
conditions  that 

2(1  +  P*(j«0K(ji»,a))  >  1  +  4A(ju>)  (3.15) 

for  additive  perturbations,  and 

o(I  +  [P*K]-1)  >  lm( ju)  +  o([P*(ju))K(ja),a)]"1)  (3.16) 

for  multiplicative  perturbations. 

In  designing  a  control  system,  and  in  choosing  a,  it  is  necessary  to 
know  where  in  the  frequency  spectrum  the  uncertainty  bounds  are  large.  In 
general,  the  multiplicative  uncertainty  bound  X^(ju>)  increases  as  frequency 
increases.  This  places  a  limit  on  the  loop  bandwith.  The  crossover  fre¬ 
quency,  defined  as 

u>c:c  (  P*(ju>c)K(ju>c,a))  -  1 


(3.17) 


I 


must  occur  approximately  where  becomes  greater  than  one.  Otherwise, 

Theorem  2.5  will  be  violated.  Typically  additive  perturbations  dominate  at 


f.: 


r. 


- 

r. 

K- 

I 

r. 

s': 

t: 


lower  frequencies,  and  so  eqn.  (3.11)  should  be  satisfied  for  tu  <  These 

requirements,  in  terms  of  loop  gains,  are  shown  in  Figure  3.2.  This  is  the 
MIMO  generalization  of  classical  SISO  loop  requirements. 

The  task  of  the  designer,  then,  is  to  manipulate  the  singular  value 
functionals  in  equations  (3.11)  and  (3.12)  through  choice  of  the  parameter 
vector  a.  In  this  thesis,  a  method  of  choosing  a  to  increase  (or  decrease) 
these  quantities  in  a  gradient  optimal  sense  is  proposed. 

3.2.  General  Sensitivity  Formulas 

The  design  of  any  controller  usually  requires  several  iterations. 

An  initial  parameter  vector  arQ  is  selected  (see  Figure  3.1)  and  then  the 
closed  loop  system  is  valuated  to  see  if  it  meets  the  design  objectives  and 
constraints.  If  not,  a  new  vector  must  be  determined  on  the  basis  of  the 
evaluation. 

As  previously  discussed,  the  design  objectives  translate  into  upper  and 
lower  bounds  in  specified  frequency  ranges  on  the  functionals  defined  below. 
In  these  equations,  the  frequency  variable  has  been  suppresed  for  notational 
brevity. 

2(1+  P*K(a  )  )  A  c  (a  )  (3.18) 

A  y  U  ^ 

a(l+  K(aQ)P*  )  A  2La>i(V  (3-19) 

2  (  I  +  [P*K(aro)fl)  A  2Mf0(Qfo>  (3.20) 


Figure  3.2.  Multivariable  loop. 
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£(I+  [K<«  )P*]“l)  t  a  (pt0) 

u  rl,  I 


(3.21) 


The  next  design  vector  must  then  Increase  (or  decrease)  the  value  of 
<j  (aQ)  In  that  frequency  range.  Here,  a  (or  )  refers  to  any  of  the  above 
functionals.  (This  discussion  applies  as  well  to  any  singular  value  ffx  Y(®0) 


of  interest.) 


The  gradient  vector  (a  )  can  be  used  to  increase  (or  decrease) 

£  (or  )  in  an  optimal  sense.  Since  the  gradient  vector  points  in  the  direc- 
X  9  Y 

tion  of  maximum  increase,  there  exists  an  such  that  if  V£x  Y(ar0)  J4  0  then 


V2X  Y(«0> 

+  '  IK.Vvll  *  -  2x.y(“o  +  8 


(3.22) 


for  every  vector  u  €  ft  .  The  vector  is  then  defined 


“i  ^  •„  * 


(3.23) 


Since  must  be  a  vector  of  constants,  a  fixed  value  of  u>  must  be  chosen  for 

the  calculation  of  the  gradient.  The  vector  is  then  guaranteed  only  to 

increase  the  value  of  o  v(a  )  In  some  neighborhood  of  u  ■  ui. .  To  increase 

“  X, I  o  l 

a  (a  )  in  a  specified  frequency  range,  several  design  iterations  may  be 

"™  A  9  X  O 

necessary.  In  this  case,  a  design  iteration  consists  of  the  following  pro¬ 
cedure. 

(1)  Calculate  vct  x  ^(0?^)  at  a  constant  «  ■ 

(2)  f»r.Vl..ktww(»k) 

(3)  Evaluate  £x  Y  (or^^)  48  4  *unction  frequency  for  different 
values  of  c.  Select  e  ■ 


(4)  If  a  (a  )  does  not  meet  the  overall  frequency  criterion, 

A)  l  Kt  1 

return  to  step  (I). 

In  each  iteration,  it  may  be  necessary  to  calculate  the  gradient  at 
a  different  frequency  than  the  previous  iteration.  Also,  it  may  be  neces¬ 
sary  to  calculate  gradients  with  respect  to  different  functionals  (X»M,A, 
Y-1,0). 

The  gradient  vector  can  be  obtained  by  considering 

a  :  ftn  -  R+  c  R  (3.24) 

where  u>  is  held  constant.  With  the  selection  of  the  appropriate  inner  pro¬ 
duct,  o(Q')  defines  a  map  between  n  dimensional  Hilbert-  space  and  the  positive 
real  line.  When  cty  v(Qf)  is  distinct,  the  Frechet  differential  can  be  com- 
puted.  By  the  Ries  z-Fr^chet  theorem,  the  differential  can  be  expressed  as 
an  inner  product.  If  the  arbitrary  vector  Aa  can  be  isolated  in  the  inner 
product,  the  other  term  must  be  the  gradient. 

6-X,Y(<M°0  -  <  V£x>yW,  4a  >  (3.25) 

This  method  will  be  used  to  calculate  the  gradient  of  the  four  singular  value 
functionals  in  equations  (3.18)  -  (3.21). 

Applying  the  singular  value  differential  formula  C2.57)  to  equation 

(3.18), 

5  c  («;to)  -  Re  [uH  5K(a;4a)  P*v  }  .  (3.26) 

A  )U 

u  and  v  are  the  left  and  right  singular  vectors  of  the  matrix  I  +  P*K(a) 
that  correspond  to  the  smallest  singular  value.  In  the  following  equations, 
the  symbols  u  and  v  will  be  used  generically  to  denote  left  and  right  sin- 
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gular  vectors  corresponding  to  the  appropriate  matrix.  Next,  equation  (3.19) 
yields 

T(®;Aar)  “  Re  {  8K(or  ;Aar)P*v  }  .  (3.27) 

Similar  formulas  are  available  in  the  case  of  multiplicative  perturba- 

y 

tions.  First,  to  avoid  long  equations  define  the  loop  transfer  matrices 

t 

^(a)  A  P*K(Qf),  Tx(a)  A  K(a)P*.  (3.28) 

Then  applying  equation  (2.57)  to  (3.20), 

6-M,0(Qf;Aa°  "  R®  C  uH6QrC[P*K]  "1}v}  (3.29) 

where  6a(.)  denotes  the  differential  with  respect  to  or  in  the  direction  Ao\ 
Employing  equation  (2.40), 

6£M,0(af;Aa°  *  ”Re  i  uV1P*5K(Q';Aq')t"1v}.  (3.30) 

Similarly, 

5£M  i(a;Aa)  "  -Re  {  uHTx15K(a;Aa)P*Tx1v}.  (3.31) 

To  isolate  the  Aar  term  in  the  above  equations,  the  structure  of  the 
controller,  the  choice  of  inner  product,  and  the  particular  space  to  which 
or  belongs  must  be  determined.  This  will  depend  on  the  particular  problem 
under  consideration. 

3.3.  The  Linear  Quadratic  Problem 

The  linear  quadratic  (LQ)  control  problem  expresses  the  control 
objectives  in  terms  of  a  performance  index.  The  objective  of  the  problem 


To  make  this  feedback  system  realistically  feasible,  a  command  input 
is  usually  added  to  the  control  u.  Equation  (3.35)  then  becomes 

u  “  -Gx  +  u 

c 

The  system  of  equation  (3.33)  with  the  control  given  above  can  be 
expressed  in  the  frequency  domain  by  transfer  function  matrices.  The  nominal 
plant  transfer  function  matrix  is  given  by 

P*(jaj)  -  <p(j«u)B  (3.37) 

where 

cp( JUJ)  A  (JOBI  -  A)"1. 

The  loop  transfer  matrix,  calculated  by  opening  the  loop  at  the  input  of  the 
plant  is 

Tx (Q,R)  -  G«pB  (3.38) 

with 

m  /  a  n  v  /■  ^  m  x  in 

Tj  (Q ,  R)  €  <& 

The  dependence  of  T^  on  the  Q  and  R  matrices  is  considered  explicitly  here. 
Also,  the  dependence  on  <u  is  suppressed  for  notational  brevity.  The  output 
loop  transfer  matrix  is 


where 


T0(Q,R)  -  <pBG 

Tq(Q,R)  €  Cnxn  . 


(3.39) 


The  closed  loop  transfer  matrix  is  given  by  the  following  input-output 
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equation, 

y  *  cpB( I  +  GcpB]  ^uc.  (3.40) 

The  controller  is  simply  the  constant  gain  matrix  G. 

K(jou,a)  »  G(Q,R).  (3.41) 

Here,  the  control  parameter  a  represents  the  (Q,R)  matrix  pair,  since  these 
matrices  are  the  choice  of  the  designer.  The  LQ  system  is  shown  in  Figure  3.3. 

Although  G(Q,R)  appears  in  the  feedback  loop  instead  of  the  feedfoward 
loop,  the  preceding  concepts  and  theorems  are  still  valid.  It  is  still  desir¬ 
able  to  make 

2(1  +  ^(Q.R))  (3.42) 

as  large  as  possible  below  crossover,  and 

£(I  +  Tj.  (Q,R)"1)  (3.43) 

as  large  as  possible  above  crossover. 

It  turns  out  theoretically  that  the  full  state  feedback  LQ  loop  has 
very  good  performance  and  additive  robustness  properties.  Starting  with  the 
ARE,  it  is  possible  to  derive  the  following  relation  [20]  provided  that  the 
open  loop  system  has  no  poles  that  lie  on  the  juu  axis. 

a  (I  +  T  (Q,R))  >  Y  u>  ;  V  admissible  (Q,R) .  (3.44) 

R  X  min 

The  R  subscript  means  that  the  singular  value  has  been  weighted  by  R. 


(3.45) 
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In  classical  terms,  this  translates  into  +®  ,-6dB  gain  margin,  and  +60° 
phase  margin  independantly  in  each  input  channel  when  R  is  a  diagonal 
matrix  [4] . 

Although  these  are  impressive  theoretical  guarantees,  in  practice 
there  are  problems.  The  fact  that  every  LQ  loop  has  a  20dB/decode  rolloff, 
contradicts  the  physical  reality  that  almost  every  real  system  has  a  trans¬ 
fer  function  with  two  or  more  poles  than  zeros  [2],  Inevitably,  there  will 
be  unmodeled  dynamics  in  the  nominal  LQ  system.  Hence,  the  designer  may 
find  it  desirable  to  manipulate  equations  (3.42)  and  (3.43)  to  obtain  even 

better  properties  than  equation  (3.44).  For  example,  a  >  lOdB  may  be 

A,  I 

a  necessary  requirement  in  a  certain  frequency  range.  The  iterative  design 
method  of  section  3.2  provides  a  way  to  tune  up  the  Q  and  R  matrices  to 
achieve  the  desired  frequency  shapes  of  the  system  singular  values. 

An  immediate  obstacle  in  applying  this  design  technique  to  the  LQ 
problem  arises  because  (Q^,R^)  must  satisfy  the  positive  definitions  re¬ 
quirements  (3.34).  In  equation  (3.23),  the  parameter  vector  ar^  represents 
(Qi»Ri).  For  example. 


Q,  -  Q  +  eVCT  (Q  ,R  ). 
i  0  ~ 'X,Y  o  o 


The  matrix  Q^  will  not  necessarily  be  positive  semi  definite.  This  problem 
can  be  circumvented  by  calculating  the  gradient  with  respect  to  the  square 
roots  f  (Q  ,R  ),  where 
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Then,  the  (Q^.R^)  pair  is  formed 


(3.47) 


R, 


e2  N  “  i,  j ' 


Note  that  only  positive  semidefiniteness  is  guaranteed,  and  R^  will  have  to 
be  separately  tested  for  positive  definiteness. 

Before  using  the  sensitivity  formulas  (3.26)  -  (3.31)  to  calculate 
the  gradient  vector,  a  Hilbert  space  and  inner  product  must  be  defined.  Here, 
the  parameter  vector  or  represents  the  (M,N)  matrix  pair.  The  singular  value 
functional  £  V(M,N)  is  defined  over  the  cross  product  Hilbert  space  Rpxax 

X,  I 


-X,Y: 


Bp  xn 
R  x 


R 


mxm 


-  R  c  R. 


(3.48) 


The  inner  product  is  defined  as 


((M^),  (M2,N2)>4  trCM^}  +tr{Njj1}.  (3.49) 

This  inner  product  is  the  matrix  analog  of  the  Euclidean  inner  product  on  Rn. 
We  are  now  ready  to  derive  the  gradient  of  the  functionals  a  ( a  ) 

“  Aj  I  O 

for  the  LQ  controller.  First,  consider  the  case  of  additive  perturbations: 


Theorem  3.1  (Input  Perturbations) 

a)  The  gradient  of  a  (M,N)  with  respect  to  M  is 

A ,  I 


7  cr 
M-A,I 


(3.50) 


where  P.  is  the  solution  of  the  Lyapunov  Equation 
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PL  rA  T  +  APl  x  -  Re  { IL  j.  +  E*  z }  -  0 

(3.51) 

and  where: 

„  _  H  -IT 

^1  I  "  'PBvu  R  B 

(3.52) 

A  -  A  -  BG 

(3.53) 

v,u  *  right  and  left  (respectively)  singular  vectors  of  I  + 

Tj (M,N)  corresponding  to  the  smallest  singular  value. 

b)  The  gradient  of  a  (M,N)  with  respect  to  N  is 

A,  I 

T 

V  <J  ■  -NP  -  NGP„  G 

N-A.I  2,1  1,1 

(3.54) 

where: 

?2  i  *  Re  j  “b  ^2  I  ^ 

(3.55) 

r  »  t  vuhr  1. 

Z  9 1  X 

(3.56) 

To  simplify  the  notation,  the  explicit  dependence  of  the  Fr^chet  dif¬ 
ferentials  on  the  variables  M,N,AM,  and  AN  will  be  dropped  unless  needed  for 

clarity. 

Proof:  From  equation  (3.27),  we  have 

S(M,N)  -A, I  *  R*tuHs(M,tl)apBv1' 

(3.57) 

The  differential  of  G  over  the  cross  product  space  is  equal 

the  differentials  over  the  individual  spaces; 

to  the  sum  of 

5(M,N)G  "  6mg  +  5ng* 

(3.58) 
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First, 


V  •  r'1bV 


(3.59) 


where  K  depends  on  M  through  the  Reccati  equation  (3.36).  Employing  equation 
(2.51)  to  differentiate  the  Reccati  equation  with  respect  to  M  yields 

^  +  *T«m  +  f^(M  +  €AM)T<M  +  eAM>  le-o 

-  fi^KBR’^K  -  KBR"IBT6mK  -  0. 


Combining  terms. 


6mK(A  -  BG)  +  (A  -  BG)T&mK  +  MTAM  +  dMTM  •  0. 


(3.60) 


This  can  be  expressed  in  terms  of  the  Lyapunov  operator  defined  in  equation 
(2.35)  and  the  closed  loop  A  matrix  defined  in  (3.53) 


^(6^)  -  -(MTAM  +  AM^M) . 
A 


(3.61) 


Since  this  operator  is  invertable  (see  equation  2.37)  the  above  equation  can 


be  solved  for  6WK 


V  ■  -L^’l(MTAM  +  AMTM). 


(3.62) 


Substituting  this  into  (3.59) 


6mg  -  -R_1BTL  j-1(MTAM  +  AMTM) . 


(3.63) 


The  differential  5^G  can  also  be  found, 
6nG  -  +  6n(R_1)BTK. 


(3.64) 


Using  equation  (2.40), 


VR'l> 


-r'^rr’1 


^-r’1(N  +  e4N)T(N  +  e4N)R‘1  | 


be 


-R_l(NTAN  +  ANTN)R-1. 


e-0 


(3.65) 


The  differential  5^K  can  be  found  by  differentiating  the  Riccati  equation 
with  respect  to  N, 

SKjjA  +  AT5nK  -  SjjKBR'V^  -  KB6n(R-1)BTK  -  KBR_1BT6NK  -  0. 
Substituting  (3.65)  into  the  above  and  combining  terms  gives 


6mKA  +  A6„K  +  GT(NTAN  +  ANTN)G  -  0. 

N  N 


In  terms  of  Lj  we  have 


la(6nk)  *  -V-  +  ^NTN)G 


5nK  -  -Lj’1  (GT(NTAN  +  ANTN)G), 


(3.66) 


Substituting  (3.66)  and  (3,65)  into  (3.64)  yields 


6ng  -  -r’1btl^‘1(gt(ntan  +  ANTN)G) 
-r’l(ntan  +  ANTN)G. 


(3.67) 


Substituting  equations  (3.67)  and  (3.63)  into  (3.58), 
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tr{ANTNRe  C Z2,I  +  E2,I^  * 
tr{4NTNP2>I} 


(3.72) 


where  P  is  defined  in  (3.55). 

2  I  A* 


The  second  term  of  (3.70)  can  be  manipulated  in  a  similar  fashion. 


We  have 


Second  term  -  Re(tr{uHR-1BTLT_1(GT(NT4N  +  ANTN)G)<pBv}  } 


Re{tr{L£l(G(NTAN  +  ANTN)G)«pBvuHR_1BT}  }. 


(3.73) 


The  AN  matrices  can  be  exposed  by  moving  the  Lyapunov  operator  L j  on  to  the 
following  terms  through  the  adjoint  operator  LgX  (see  equation  (2.38)).  The 
above  equation  is  then 

Second  term  -  Re{tr(GT(NTAN  +  ANXN)GLffT  (cfBvu1^"1*1)  }  }. 

Breaking  this  up. 


Second  term  -  Re { t r (G^ttXAN GL  jT  (Z^  j.)} 


(3.74) 


+  tr{GXANXNGL_T  (Z.  .)}  } 
A  *»■*■ 


where  I  is  defined  in  (3.52).  Using  elementary  trace  properties, 

Re{tr{GL_T  (Z.  _ ) GTNTAN }  +  tr {AN XNGL  —  t ^ (2.  t)G}} 

A  A 


-  Re{tr{ANTNGlfc_T1(Zj^I  +  Z^G1}  ) 

-  tr(ANTNGL_f1(ReiZ1>I  +  ^1#I})GT}  } 


(3.75) 
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where  Che  linear  character  of  L-.x  has  been  exploited.  According  to  definl- 

A 

tion  (3.51),  this  can  be  written  as 


-  tr^N^GPj^  jG1}}. 


(3.76) 


The  last  term  of  (3.70)  can  be  manipulated  similarly, 

Third  term  -  {tr^u^'S1  L-’1(MTAM  +  AMTM)<pBv}} 

A 

-  Re{tr{L£L(MTAM  +  AM^cpBvuW}} 

-  Re{tr{(MTAM  +  4MTM)  L_T  (S.  _)}. 

A  1»-L 


(3.77) 


Further  juggling  as  in  equations  (3.74)  -  (3.75)  yields 


Third  term  -  tr{AMTM  L_^1(Re{Z.  +  _})} 


tr{4MTMP  }. 


(3.78) 


Substituting  equations  (3.72),  (3.76)  and  (3.78)  into  (3.70),  we  have 


5(M,B)2a  ’  -«UKTKP2>i}  - 


In  terms  of  the  respective  inner  products, 


5(M.N)2A  ■  •  -  <^.NP2jI)  -  <*M.MPlir> 

-  (AN.-NGP^G1  -  NP2  j)  +  (AM.-MPj.  x>. 


The  right  hand  entries  of  these  inner  products  are  the  gradient  matrices.  □ 
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P: 


« 


k 


y*m-. 


* 


i 


K 


Theorem  3.2  (Output  Perturbations) 

a)  The  gradient  of  a.  _(M,N)  with  respect  to  M  is 

A  )U 


7m2a,0(m'n>  ■  -”1.0 


where  Q  is  the  solution  of  the  Lyapunov  equation 


pi  0*T  +  AP10  “  o+sio1>0 

and  where:  2. 


%0 


H  -IT 
vucpBR  B 


u,  v  are  the  left  and  right  singular  vectors  corresponding  to  the 

singular  value  of  the  matrix  I  +  T  . 

0 

b)  The  gradient  of  o  (M,N)  with  respect  to  N  is 

™  A  jU 

7n2a.0<M-S>  •  -  ”2.0  -  NGP1,OoT 


where: 


P„  ~  -  ReiX  „  +  e:  J 


'2,0 


2,0  2,0' 


_  _  H  -1 

S2,0  “  GVU  CPBR  ’ 


Proof:  From  equation  (3.26),  the  differential  is 


4(M.N)2a,0  •  ’tetuH7B4(M.N)Gv}- 


The  differential  of  G  was  previously  calculated  in  equation  (3.68) 
tutlng  this  into  the  above,  and  taking  the  trace. 


6/vl  „.cr  .  -Re{tr{uHcpBfR*^(NTdN  +  ANTN)G 
A,0 


(3.79) 

(3.80) 

(3.81) 
smallest 

(3.82) 

(3.83) 

(3.84) 

.  Substl- 


(3.85) 


+  R  B  Lj  £gX(N  AN  +  AN  N)G} 

+  r_1btl  £  1[MTAM  +  AM^kjIv}}. 

Arranging  the  first  term  as  In  equation  (3.71), 

First  term  -  Re{tr[uHcpBR’1(NTAN  +  ANTN)Gv}} 

-  Re{tr{uHcpBR-1NTANGv}  +  tr^epBR^AN^Gv}). 

Using  the  coranunitive  and  transpose  properties  of  the  trace  and  definitions 
(3.83)  and  (3.84), 

First  term  -  Re(tr[GvuHcpBR-1NTAN  }  +  tr {AN^NGvu**epBR~ ^ } } 

-  Re{trt22>0NTAN}  +  tr(ANTNT2>0 }} 

-  tr{ANTNP2  0}.  (3.86) 

The  second  term  of  (3.85)  becomes 
Second  term  -  Re[tr[uH<pBR_1BTL  jl  £gT(NTAN  +  ANTN)g}v}} 

-  Re{tr{L  j1  £gT(NTAN  +  ANTN)G  Jvu^BR^B1}} 

-  ReCtrlLy1  {gT(NTAN  +  ANTN)G}Z]L  q}}.  (3.87) 

Employing  the  adjoint  operator  and  proceeding  as  in  equation  (3.74)  through 
equation  (3.77), 

Second  term  ■  tr{ANTNGP,  GT}. 


(3.88) 


The  third  term  in  equation  (3.85)  can  be  manipulated  in  a  similar  way, 


Third  term  ■  Re{tr{u^cpBR  ^  {M^AM  +  AH^M}v} 

-  Re{tr[L  j1  {MTAM  +  AM^Jvu^pBR-^1}} 

-  Re[tr(L^l{MTAM  +  AMTm}  q}}.  (3.89) 

And  proceeding  as  in  equations  (3.77)  -  (3.78), 

Third  term  ■  tr{AM^MP..  }.  (3.90) 

i  ,0 

Substituting  equations  (3.90),  (3.88)  and  (3.86)  back  into  (3.85), 

5(M,N)Sa,0  ■  -  «t"<T“Pli0cT} 

-  <AM,-MPlj0>  +  <AN,-NP2j0-NGP1>0GT). 

This  completes  the  proof.  □ 

The  case  of  multiplicative  perturbations  yields  similar  results. 
Theorem  3.3  (Input  Perturbation) 

Suppose  that  the  matrix  B  (in  equation  3.33)  has  full  column  rank,  and 
that  the  pair  (A,M)  is  observable. 

a)  The  gradient  of  au  ,(M,N)  with  respect  to  M  is 

7M-M,I  ■  MP3,I  (3,91) 

where  P 


is  the  solution  of 
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and  where 


_  .  -1  H  -1  -  1_T 

2^  j  ^  cpBT^  vu  Ti  R  B  . 


u  and  v  are  the  left  and  right  singular  vectors  of  1 
ing  to  the  smallest  singular  value. 


(3.93) 


T~^(M,N)  correspond- 


b)  The  gradient  of  or  _(M,N)  with  respect  to  N 

—  M,l 


is 


7N2m,i  •  NP4.I  +  ngp3.ig 


(3.94) 


where 


and, 


P4,I4R'lE4,I  +  Z4,I} 


E4,i  2 


(3.95) 

(3.96) 


Proof;  From  equation  (3.31) 


6(M,N)2M,I  *  "Re^-U  TI  6(M,N)G5f5BTI  V^' 


(3.97) 


The  assumption  that  B  has  full  rank  and  that  (A,M)  is  observable  is  needed 

to  guarantee  the  existance  of  the  inverse  of  T^  *  GcpB. 

The  differential  „.G  was  previously  computed  in  equation  (3.68). 

(M,N) 

Substituting  this  in, 

\m,N)2m,I  •  {MT4M  +  4MTM) 

+  r^b1^1  [gT(NTAN  +  ANTN)g} 

+  R-1(NTdN  +  ANTN)G]cpBT~^v}. 


Taking  the  trace  and  rearranging, 


5(m  n)£m  j  -  ReUrCA'V^N^N  +  ANTN)Gc{aT"lv) 

+  tr{uHT‘1R’1BTL^L{GT(NT6N  +  ANTN)G}cpBT*Lv} 

+  tr{uHT"1R_1BTI^1{MTAM  +  ^MjcpBT^v} } .  (3.98) 

The  first  term  of  the  above  can  be  manipulated  as  in  equations  (3.71) 
(3.72)  to  yield 

Re{tr{uHT^1R'1(NTM  +  ANTN)v}}  -  tr(ANTP4  I).  (3.99) 

Following  equations  (3. 7A)-(3. 76) ,  the  second  term  of  (3.98)  becomes 
Re{tr[uHT‘1R'1BTL^1(G(NTAN  +  AN^GlcpBT^v}} 

-  tr{ANTNGP3  /}.  (3.100) 

Following  equations  (3. 77)-(3. 78)  the  last  term  of  (3.98)  is  written 
Re{tr{uHTj1R"1BTL^1{MTAM  +  WttjlfcpBTjM} 

-  tr{dMTMP3  j}.  (3.101) 

Substituting  equations  (3.99)-(3.101)  into  equation  (3.98)  yields 

!(m,»)2m,i  •  «t“TMr3iI)  +  trC®T(»P4iI  +ngp3jIg)} 

-  <AM,MP3  r>  +  <AN,NP4  r  +  NGP3  xGT). 

This  completes  the  proof.  □ 

Theorem  3.4  (Output  Perturbations) 

Suppose  that  the  pair  (A,M)  is  observable,  and  that  the  matrix  B  has 
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rank  n.^ 

a)  The  gradient  of  £M  Q  (M,N)  with  respect  to  M  is 


V  a  =>  MP„ 

M-M,0  3,0 

(3.102) 

where  P. 

is  the  solution  of  the  Lyapunov  equation 

P,  5T  +  IP-  -  Re{2_  +  }  -  0 

j  jO  v  )U  j  jU  w  jU 

(3.103) 

and  where: 

-1  H  -1  -1  T 

Z,  A  T  vu  T  ^BR  B 

3,0  -  0  0  Y 

(3.104) 

u,  v  are  the  left  and  right  singular  vectors  of  the  matrix  I  +  T^CM.N)  corre¬ 
sponding  to  the  smallest  singular  value. 

b)  The  gradient  of  a  (M,N)  with  respect  to  N  is 

— *  M,0 


7n*m,o  ■  np4,0  +  bgp3,ogI 

(3.105) 

where 

l4,0  4  + 

(3.106) 

and 

\o 4 

(3.107) 

Proof: 

From  equation  (3.30), 

5(M,N)2m,0  ■  -Ret“\l’Bi<M>N)Gllv} 

(3.108) 

Notice  that  B  and  G  are  of  rank  n,  thus  the  inverse  of  T  *  cpBG  exists.  Sub- 

0 

stltuting  equation  (3.68)  into  the  above  and  taking  the  trace, 

2 

This  assumption  is  unrealistic.  See  Chapter  5  for  further  remarks. 


6(M,N)  -m,o 


Re{tr {u  Tq  cpB[R  (N  AN  +  AN  N)G 


+  r"1btl£1[gt(ntan  +  ANTN)G} 

+  R^bV  {MTAM  +  AMTM  ]  ]  TQ‘  Lv  }  }  . 


(3.109) 


(3.72) 


The  first  term  of  the  above  can  be  rearranged  as  in  equations  (3.71)- 


Re{tr{uIi^'1cpBR"1(NTAN  +  ANTN)GT0’M}  -  tr{ANTNP4  Q } .  (3.110) 

The  second  term  of  equation  (3.109)  can  be  rearranged  as  in  equations 


(3. 74)-(3. 76) 


Re[tr{uH^"LcfBR'1BTL^1i;GT(NTAN  +  ANTN)g}h^"1v}  } 
-  tr{ANTNGP,  _GTj. 


(3.111) 


Following  equations  (3 . 77) - (3 . 78)  the  last  term  of  (3.109)  becomes 
Re { t r  1  uHT^"  1cpBR " XB TL^ 1  [MT AM  +  AMTm}HT_1v}  »  tr[AMTMP3  Q  }.  (3.112) 

Substituting  equations  (3. 110)- (3. 112)  into  equation  (3.109)  gives 


us  the  final  form, 


6 a 


(M,N)  -M,0 


<AM,MP,  _>  +  <AN,NP,  A  +  NGP,  rtGT> 


which  proves  the  theorem.  □ 

In  many  LQ  designs  an  observer  must  be  used  to  get  full  state  feed¬ 
back.  It  has  been  shown  that  the  inclusion  of  an  observer  (or  any  dynamic 
element)  in  the  LQ  loop  negates  the  robustness  guarantees  implied  by  equa¬ 
tion  (3. 44), [21],  Hence  the  need  to  manipulate  the  system  singular  values 


to  obtain  increased  robustness  and  sensitivity  margins  is  critical  in  the 
output  feedback  case. 

The  calculation  of  the  gradient  vector  is  complicated  slightly  by  the 
inclusion  of  an  observer  in  the  loop.  The  observer  transfer  function  matrix 
also  depends  on  the  feedback  gain  matrix  G,  and  hence  on  the  (M,N)  matrix  pair. 
The  product  rule  for  differentiation  must  then  be  used  in  equations  (3.26)  - 
(3.31).  Following  this  line  of  reasoning,  it  is  a  straightfoward  exercise 
to  extend  the  above  theorems  to  the  output  feedback  problem. 


4 .  APPLICATION 


In  this  section  the  gradient  design  method  will  be  applied  to  a 
4th  order  nominal  linear  quadratic  regulator  to  improve  the  low  frequency 
properties.  An  alternative  asymptotic  design  method  which  has  been  success¬ 
fully  used  [22]  will  also  be  applied.  The  two  designs  will  be  compared,  and 
certain  important  advantages  of  the  gradient  design  will  be  discussed.  The 
nominal  plant  consists  of  the  standard  linear  time  invariant  differential 
equations  in  state  variable  form 

x  ■  Ax  +  Bu,  xERn,  uE  Rm.  (4.1) 

The  feedback  control  which  minimizes  the  cost  functional 

00 

J  ■  /  x^Qx  4-  u^Ru  dt  (4.2) 

0 

is  given  by 

u  *  -Gx  (4.3) 

where 

-1  T 

G  -  R  B  K. 

K  is  the  solution  of  the  A.R.E., 

KA  +  ATK  +  Q  -  KBR“1BTK  -  0  (4.4) 

where  it  is  assumed  that  (A,B)  is  stabilizable  and  (i/q,A)  is  detectable. 

Q  and  R  satisfy  the  usual  positive  definiteness  requirements,  and  otherwise 
are  arbitrary. 

To  illustrate  the  design  procedure,  values  for  A,  B,  Q,  and  R  were 
selected  as  shown  below 
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■ 

I 

H 

(4.5] 


Q 


o 


R  - 

o 


(4.6] 


The  subscript  above  indicates  initial  values  for  the  weighting  matrices. 
The  nominal  plant  is  stable  with  eigenvalues  at 


A!--l 

X2--2 
X3,4  *  -3±j4. 

The  closed  loop  system  has  eigenvalues  at 


(4.7) 


with 


Xx  2  -  -8.01  +  J8.24 
A3  4  -  -1.16  + j  .408 


G 

o 


2.89  4.13  2.13  -0.161 
2.38  4.45  1.59  -0.128 


(4.8) 


(4.9) 


As  discussed  in  Section  3.1,  the  singular  values  of  the  return 
difference  matrix,  the  loop  matrix,  and  the  inverse  Nyquist  matrix  provide 
a  measure  of  the  various  closed  loop  system  properties  (robustness,  sensi¬ 
tivity,  disturbance  rejection,  etc.).  These  quantities  are  plotted  in 
Figures  4.1,  4.2,  and  4.3  respectively,  for  this  nominal  system.  For  con¬ 


venience,  the  notation 


SINGULAR  VALUE  PLOT 


Singular  values  of  the  return  difference  matrix  for  the  nominal 
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Figure  4.2.  Singula 


SINGULAR  VALUE  PLOT 


Singular  values  of  the  inverse  Nyquist  matrix  for  the  nominal  system 


(4.10) 
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a(I  +  G^B)  -  a 

•¥)4'l,I,l  <4-U) 

a(I+  [G^B]"1)  -  a  (4.12) 

will  be  used.  The  1  subscript  refers  to  the  Independent  variables  (Q^.R^). 

In  this  thesis,  the  bandwidth  (u^)  of  a  MIMO  system  will  be  defined 
to  be  the  frequency  at  which  the  largest  singular  value  of  the  transfer 
function  matrix  drops  to  1//2  of  its  zero  frequency  value.  Recalling  that 

3([I  +  Gi<pB]”1G1  B)  -  [o(I+  [G^B]"1)]"1  (4.13) 

the  closed  loop  bandwidth  can  L  found  from  the  plot  of  a„  _  , .  From 

-M,I,i 

Figure  4.3,  the  closed  loop  bandwidth  is  to  —  21  rad/sec.  Notice  that  here 

O 

we  are  considering  the  input-output  response  between  U£  and  y,  as  shown  in 
Figure  4.4. 

The  crossover  frequency  (u>  )  is  defined  to  be  the  frequency  at 

c 

which  the  largest  singular  value  of  the  loop  transfer  matrix  has  magnitude 

one.  From  Figure  4.2  w  ■  11  rad/sec. 

c 

In  most  LQ  designs,  the  (Q,R)  matrices  are  fine  tuned  by  an  itera¬ 
tive  trial  and  error  method.  This  is  because  there  are  usually  several 
different  design  objectives  and  many  constraints.  Since  there  is  no  direct 
relationship  between  (Q,R)  and  these  objectives  and  constraints,  a  series  of 
(Qi,Rl)  matrices  must  be  chosen,  and  the  closed  loop  system  tested  at  each 
iteration.  The  information  obtained  at  the  ith  analysis  is  used  to  make  the 
next  adjustment  on  (Q^,R^).  In  the  simple  example  presented  here,  only  one 


iteration  is  necessary  to  expose  the  characteristics  of  the  design  method 
developed  in  Section  3. 


For  purposes  of  Illustration,  the  design  objective  in  this  example 


will  be  simply  to  increase  the  value  of  a  _  n  at  frequencies  w <  .1.  This 
will  reduce  the  effect  of  random  disturbances  in  this  frequency  range.  It 
will  also  make  the  system  more  robust  and  less  sensitive  to  additive  modeling 
errors  (in  the  same  frequency  range)  as  shown  in  Figure  4.4.  Notice  that  the 
perturbation  surrounds  the  plant  and  controller.  In  terms  of  Theorem  2.4,  the 
nominal  plant  includes  the  controller, 

P*(jw)  -  G^B.  (4.14) 

In  this  case,  the  value  of  ^  ^  is  a  nonconserva-.ive  bound  for  robustness. 

In  other  words,  there  exists  a  perturbation  as  shown  in  Figure  4.4  with 


"^A1  *  2a, 1,0 


that  destabilizes  the  system. 

For  design  constraints,  we  will  assume  that  the  nominal  bandwidth 

and  input-output  properties  are  acceptable,  and  should  not  be  altered 

drastically.  In  particular,  the  closed  loop  bandwidth  should  not  Increase 

much,  because  if  it  does  the  system  becomes  more  susceptible  to  high 

frequency  multiplicative  perturbations . 

One  way  to  affect  the  singular  values  is  to  assume  the  (Q  ,R  ) 

o  o 

matrix  pair  reflects  the  appropriate  priorities.  Then,  the  feedback  is 
determined  by  trading  off  between  control  energy  and  speed  of  response. 

This  is  accomplished  by  introducing  a  positive  scaling  parameter  in  the  cost, 

00 

J(o)  ■  /  x^Q  x  +  pu^R  u  dt.  (4.15) 

n  0  O 
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This  method  has  been  used  for  eigenvalue  adjustments  [10] .  For 
large  values  of  p,  J(p)  approximates  the  minimim  energy  cost  functional. 

In  this  case  the  stable  open  loop  eigenvalues  remain  almost  unchanged  in 
the  closed  loop  system.  The  unstable  ones  are  reflected  symmetrically  about 
the  ju  axis.  As  p  tends  to  zero,  the  closed  loop  eigenvalues  either  tend  to 
transmission  zeros,  or  to  minus  infinity  in  a  Butterworth  pattern.  This 
result  is  a  direct  consequence  of  the  following  relation  which  is  true  for  any 
LQ  system  [11] 

(I  +  <V(jco)B)HRo(I  +  G*(jai)B)  -  Rq  +  i  [*>  (ja))B]HQ^  (juj)B.  (4.16) 


This  equation  can  also  be  used  to  derive  relationships  between  p  and  the 
singular  value  functionals. 

Lemma  4.1:  Consider  the  LQ  system  described  in  equations  (4.1)-(4.4),  where 


Q 

R 


(4.17) 


pR  . 
o 


Then,  for  each  frequency  w  for  which  ju  is  not  an  eigenvalue  of  the  open  loop 
system. 


A  (R  )  <;  lim  o,  (I  +  G(p)*B)  <  A  (R  ) 
nxxn  o  p  y fY*!  i^  mflx  o 

o 

i  ■  1,2, ... ,m 


(4.18) 


where  o,  (•)  is  the  ith  R  weighted  singular  value,  as  defined  in  equation 
iR  o 

o 

equation  (3.45). 

Proof :  Consider  equation  (4.16).  Since  these  matrices  are  equal,  their 
eigenvalues  must  be  equal, 

Xi((I  +  G(p)^B)HRo(I  +  G(p)^B))  -  vJ(Rq  +  £  kB]HQ^B)vi. 


(4.19) 
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Here,  v^  is  the  orthonormal  eigenvector  defined  by 

(RQ  +  i  ^BlHQoV>B)Vi  -  X^.  (4.20) 

In  terms  of  singular  values  and  Euclidean  norms 

a.  (I  +  G(p)v»B)2  »  H»¥"v.l|2  +  i  [^BvJ2.  (4.21) 

In  O  I  P  O  1 

Ro 

Notice  that  the  term  II  /q^Bv^I  is  bounded  for  all  p  whenever  jco  is  not  an 
eigenvalue  of  the  open  loop  system.  Taking  the  limit  on  each  side  of  the 
above  equation  and  recalling  that 

Iv  I  *  1  •  a(/R~)  <  iv'TvJ  <  a  (/r~) 
i  —  o  o  1  o 

implies  equation  (4.18).  □ 

Notice  that  when  Ro»I,  equation  (4.18)  reduces  to 

lim  a  (I  +  G(p)*B)  -  1. 

p-H»  1 

A  similar  type  of  result  is  available  when  p  tends  to  zero. 

Lemma  4.2:  Consider  the  LQ  system  described  in  the  above  lemma.  Then  for 
all  co  such  that  jco  is  not  a  transmission  zero  of  <^QvKjco)B, 

lim  a(I  +  G(p)*B)  -  «.  (4.22) 

p-*-0 

Proof ;  From  equation  (4.21), 

Og  (I  +  G(p)*B)  >  j  I  •’^Q^Bv^l  . 

<5 

Since  jco  is  not  a  transmission  zero  of  /Q-  <lB, 

o 

i/Q^Bv^  f  0. 


Consequently, 


Proof :  From  equation  (4.19) 


o.  (I  +  G(p)*B)2  -  A.  (R  [*B]HQ  *B). 

i  o  p  o 

o 


Differentiating  each  side  and  utilizing  equation  (2.56), 


do. 


kR 


2o 


i„  dp  £  *  '  -7  VBV 

Ro  p 


Where  the  independent  variable  on  the  left  side  has  been  dropped  for  brevity. 


The  vector  v^  is  defined  in  equation  (4.20).  Then, 


do. 


dp 


2o  p 
iRo 


- — r  l  VBv.I 

2  o  i 


which  implies 
do. 


dp 


<0  Vp  >  0 . 


This  implies  equation  (4.24). 


□ 


These  facts  show  that  by  scaling  Rq  by  a  parameter  p  as  in 


equation  (4.15)  the  value  of  a.  _  in  the  LQ  system  can  be  adjusted  to  any 

“A,  I 


desired  value,  as  long  as  v^^B  has  full  rank  and  has  no  transmission  zeros 

o 


on  the  ju  axis. 


This  asymptotic  technique  was  applied  to  the  nominal  system 

1 


described  in  equations  (4.5)— (4.6) .  Choosing  p*  .26, 


(Q1,R1)  -  (I4,  .26I2). 


(4.25) 


This  value  was  chosen  for  purposes  of  comparison  with  the  following 
gradient  design. 
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This  places  the  closed  loop  eigenvalues  at 

Xx  -  -.992 
X.  -  -2.32 


X-  , 

-  -11.3+11.3 

»v 

3,4 

-  •*, 

with  a 

feedback  gain  matrix 

k 

1 

■ 

5.40 

7.18  3.83 

-.563 

G„  * 

1 

_4 . 28 

8.13  2.33 

-.428 

Graphs 

of  the  important  quantities  o  . , 

°L,I, 

(4.26) 


,  and  a, 


M.1,1 


are  shown  in 


Figures  4.5,  4.6,  and  4.7.  From  Figure  4.5,  the  low  frequency  value  of 
i  is  greater  than  j  q  by  about  4dB. 

This  indicates  an  improvement  in  robustness,  disturbance  rejection 
and  sensitivity  reduction  properties.  However,  the  design  constraints  have 
been  compromised  somewhat.  From  Figure  4.6  we  see  that  the  crossover  fre¬ 
quency  has  increased  from  the  nominal  value  of  a  11  rad/sec  to  a>c  *  20 
rad/sec.  From  Figure  4.7,  the  bandwidth  has  increased  from  *  21  rad/sec  to 
<d  »  30  rad/sec.  This  shows  that  the  system  has  become  more  susceptible  to 
high  frequency  multiplicative  perturbations.  Also,  the  magnitude  of  the 
feedback  matrix  has  increased.  In  the  nominal  system. 


1 G  8  -  7.12. 
o 


(4.27) 

For  the  asymptotically  designed  system, 

(4.28) 

This  is  an  80%  Increase  in  magnitude.  It  indicates  that  more  control  energy 


IG1I  -  12.8. 


is  needed  to  attain  the  increased  value  of  a.  T 

-A, 1,1 


k.- 


IV 


SINGULAR  VALUE  PLOT 


Singular  values  of  the  return  difference  matrix  for  the  asymptotically 
designed  system. 


asymptotically  designed  system. 


SINGULAR  VALUE 


Figure  4.7.  Singular  values  of  the  inverse  Nyquist  matrix  for  the  asymptotically 
designed  system. 
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The  design  procedure  developed  in  Section  3  was  also  applied  to 
this  example  via  Theorem  3.1.  The  gradient  of  o,  T  n  was  computed  with 
respect  to  where  R  was  considered  a  constant  parameter.  At  . 1 , 

the  gradient  matrix  was  found  to  be 


0.200 

0.512 

0.313 

-0.0525 

0.512 

1.23 

0.131 

-0.0796 

0.313 

0.131 

1.19 

-0.0640 

-0.0525 

-0.0796 

-0.0640 

-0.0281 

(4. 


The  matrix  was  calculated  from  equation  (3.47)  with  e*l. 


1.80 

1.80 

1.13 

-.178 

1.80 

5.26 

.743 

-.295 

1.13 

.743 

4.91 

-.233 

-.178 

-.295 

-.233 

1.07 

R 


2 


It  was  found  by  trial  and  error  that  this  value  of  e  produced  the  greatest 

increase  of  o.  _  ».  Plots  of  a.  .  ,,  a.  _  and  ow  T  -  are  shown  in 
-A, I, 2  A, I, 2  L,I,2  M,I,2 

Figures  4.8,  4.9,  and  4.10.  The  closed  loop  eigenvalues  are  now  at 


with 


«  -.871 
X2  -  -2.46 
X3  4  -  -8.48  +  J8.51 


3.03 


03  4.31  2.64  -0.167 

-  '  ‘1  -0.181 


(4.: 


1 


1.3 


SINGULAR  VALUE  PLOT 


Singular  values  of  the  loop  transfer  matrix  for  the  system  designed 
by  the  gradient  method. 


SINGULAR  VALUE  PLOT 


Singular  values  of  the  inverse  Nyquist  matrix  for  the  system  designed 
by  the  gradient  method. 


As  before,  the  low  frequency  value  of  o  -  is  4dB  greater  than 

—A, 1 ,  / 

-A  I  0*  c^ose<^  1°°P  bandwidth  has  increased  slightly  from  the  nominal 

value  of  oig  ■  21  to  ojg  *  22  rad/sec.  The  crossover  frequency  has  remained 
approximately  the  same  as  the  nominal  value  at  uc *  11  rad/sec.  This  indicates 
that  the  tolerance  of  the  gradient  designed  system  to  high  frequency  multi¬ 
plicative  errors  has  remained  about  the  same  as  the  nominal  system. 

Comparing  the  eigenvalue  placements  of  the  three  designs,  we  see 
that  the  asymptotic  system  has  become  faster  than  the  nominal  system  in  that 
the  eigenvalues  at  -8.01  +  j 8. 24  have  moved  out  to  -11.3  + jll. 3.  The 
gradient  designed  system  shows  that  the  extra  control  energy  needed  to  push 

the  eigenvalues  to  the  left  is  unnecessary,  since  the  same  value  of  a  _ 

—A,  I 

was  achieved  in  the  gradient  design  where  the  fast  eigenvalues  moved  only 
slightly  to  the  left.  That  less  control  energy  is  needed  in  the  gradient 
designed  system  is  reflected  in  the  fact  that  the  magnitude  of  the  gain 
matrix  G ^  in  equation  (4.32)  has  increased  by  only  17%  from  the  nominal 
value  in  equation  (4.27). 

The  size  of  the  G  matrix  is  also  important  in  relative  robustness 
and  sensitivity  measurements,  where  the  norm  of  the  largest  allowable  per¬ 
turbation  is  compared  to  the  norm  of  the  nominal  plant  from  Figures  4.1  and 
4.2 


I  P.l  o(I  +  G*B) 
- L.  .  - - 2 - 

IP*II  o(G  * B) 
o 


-A, 1,0 


L,I,0 


0)  <  .1. 


(4.33) 


Hence,  the  nominal  system  could  be  subjected  to  a  low  frequency  (uk.I) 
additive  perturbation  with  a  magnitude  83%  as  large  as  the  nominal  system 


itself,  and  remain  stable.  Also 


°L,I,0 


.76 


a>  <  .1 


(4.34 


and  so  the  nominal  system  would  be  insensitive  to  perturbations  with  up  to 
76%  magnitude  variation.  In  the  asymptotically  designed  system. 


£a,i,i 

?L,I.l 


.59 


For  this  last  design. 


-A  I  2 

*  1.33  , 

°L,I,2 


~A, 1 ,1  ~  1 
aL,I,l 


.34, 


0)  <  .1. 


(4.35 


-A^I,2 

ffL,I,2 


.76 


id  <  .1. 


(4.36 


Notice  that  in  each  case  above,  the  magnitude  of  the  nominal  plant  (o  _  .) 

L9  I  9  1 

is  different  because  the  matrix  is  part  of  it.  Because  of  this,  the 
asymptotic  technique  actually  decreased  the  relative  robustness  and  sensi¬ 
tivity  tolerances.  Since  I1G-1I  and  hence  a  _  .  did  not  increase  much  in  the 

Z  L91 9  z 

last  design,  the  relative  robustness  margin  was  increased  from  the  nominal 
value . 

The  fact  that  II  Go0  did  not  increase  much  over  0GB  is  also 
l  o 

reflected  by  the  low  frequency  value  of  a  _  ,,  which  is  approximately  the 

A9  1  9  4 

same  as  3.  .  ..  This  indicates  an  important  feature  of  the  design 
technique  of  Section  3  -  the  motion  of  the  singular  values  can  be  separated. 
If  it  was  necessary  to  increase  the  value  of  a  T  n,  it  could  also  be  done 
with  this  technique.  Hence  the  designer  has  the  option  of  manipulating 
each  system  singular  value  Individually. 


5 .  CONCLUSION 


In  Section  3  we  saw  how  important  closed  loop  properties  such  as 
robustness  and  sensitivity  translate  into  requirements  on  the  singular 
values  of  the  return  difference  and  inverse  Nyquist  matrices.  'This  in  turn 
imposes  requirements  on  the  singular  values  of  the  loop  transfer  matrix. 

Thus  a  major  objective  in  design  is  to  synthesize  a  controller  that  adjusts 
the  frequency  shapes  of  the  singular  values  of  the  loop  transfer  matrix  to 
meet  the  requirements  imposed  by  the  design  criteria.  If  the  mathematical 
structure  of  the  controller  is  predetermined,  it  is  possible  to  calculate 
the  gradient  of  the  singular  value  functional  with  respect  to  the  adjustable 
parameters  of  the  controller.  The  gradient  vector  gives  the  direction  to 
change  the  control  parameter  vector  (whose  elements  are  the  adjustable 
parameters)  that  will  have  the  greatest  possible  effect  on  a  given  singular 
value.  Since  the  control  parameter  vector  is  assumed  to  be  constant,  a 
fixed  frequency  must  be  substituted  into  the  gradient  formula.  Then,  we  are 
guaranteed  only  that  the  singular  value  functional  will  be  affected  in  some 
neighborhood  of  that  frequency.  To  adjust  the  entire  singular  value  curve  as 
a  function  of  frequency,  the  gradient  must  be  Iteratively  calculated  at 
different  frequencies,  and  the  control  parameter  vector  adjusted  each  time. 

The  linear  quadratic  optimal  control  problem  can  be  used  as  a 
method  to  obtain  a  full  state  feedback  controller  for  a  general  design 
problem.  This  method  has  the  advantage  of  being  a  systematic,  numerically 
feasible  way  to  choose  a  feedback  gain  matrix  that  has  certain  minimum 
guaranteed  robustness  and  sensitivity  margins.  However,  the  design  problem 
is  now  translated  into  how  to  choose  the  weighting  coefficients  (the  Q,R 
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matrices) .  The  dependence  of  the  system  singular  values  on  these  matrices 
is  certainly  nonobvious.  Since  the  mathematical  structure  of  the  controller 
is  known,  the  gradient  of  these  singular  value  functionals  with  respect  to 
the  Q,R  matrix  pair  can  be  calculated.  This  provides  the  designer  with  a 
tool  to  adjust  the  Q,R  matrix  pair  to  bend  the  singular  values  (as  functions 
of  frequency)  into  a  more  desirable  shape. 

The  new  parameter  vector  (or  Q,R  pair,  in  this  case)  is  formed  by 
adding  the  gradient  times  a  scalar  to  the  old  parameter  vector.  In  the 
optimal  gradient  method,  the  value  of  the  scalar  is  determined  by  considering 
the  cost  functional  as  a  function  of  the  scalar  and  setting  its  derivative 
equal  to  zero.  The  complexity  of  the  relationship  between  the  singular  values 
and  the  scalar  parameter  prevent  us  from  doing  this  in  the  LQ  problem.  Thus 
the  best  value  of  the  scalar  must  be  determined  by  trial  and  error  in  this 
design  procedure. 

In  Theorem  3.4,  it  was  necessary  to  assume  that  the  matrix  B  had 
rank  n  to  insure  the  existence  of  the  inverse  of  Tq  ■  QBG  (i.e.  the  system 
had  at  least  as  many  inputs  as  it  did  states) .  This  requirement  is  not 
likely  to  be  satisfied  in  practice.  It  may  be  possible  to  relax  this 
assumption  by  using  the  pseudoinverse.  This  possibility  is  currently  under 
investigation. 

The  LQ  design  presumes  that  full  state  feedback  is  available. 

In  practice,  this  is  often  not  the  case,  and  the  missing  states  must  be 
created  by  an  observer.  Unfortunately,  when  an  observer  is  inserted  in 
the  LQ  loop,  all  robustness  and  sensitivity  guarantees  are  lost.  This 
makes  the  issue  of  how  to  adjust  the  LQ  feedback  matrix  to  increase  robust¬ 
ness  and  sensitivity  properties  much  more  important.  The  calculation  of 


the  gradient  with  an  observer  in  the  loop  is  a  straightforward  extension 
of  the  formulas  for  the  full  state  feedback  case.  However,  the  effective¬ 
ness  of  these  formulas  remains  to  be  experimentally  verified. 

When  white,  uncorrelated  Gaussian  noise  signals  are  inserted  into 
the  system  and  state  measurement  equations  of  the  LQ  regulator,  the  optimal 
solution  is  the  linear-quadratic-Gaussian  (LQG)  controller.  The  controller 
is  just  the  LQ  feedback  matrix  plus  an  observer  whose  gains  depend  upon  a 
dual  Riccati  equation  [23] .  The  dual  equation  is  a  function  of  the  noise 
covariance  matrices.  It  is  possible  to  calculate  the  gradient  of  the 
singular  value  functionals  witn  respect  to  these  matrices  using  the  same 
type  of  derivation  as  in  Section  3.  This  could  be  useful  it  the  dominant 
LQ  poles  are  fixed  by  some  primary  design  constraint.  The  effect  of 
altering  the  noise  covariance  matrices  would  be  that  the  observer  would  no 
longer  provide  optimal  noise  reduction,  hence,  the  robustness  and  sensi¬ 
tivity  properties  of  the  LQG  system  could  be  affected  at  the  expense  of 
noise  performance. 

In  Section  4  the  LQ  adjustment  procedure  was  applied  to  an  example 
to  illustrate  its  effectiveness.  In  this  example,  the  additive  perturbation 


was  assumed  to  surround  the  plant  and  the  controller.  This  assumption  was 
made  because  it  makes  the  singular  value  bound  simpler.  If  the  perturbation 
added  directly  to  the  nominal  plant,  we  would  have  had  to  take  the  gradient 
of  the  singular  value  ratio 


o(I  +  G  B) 
5(G) 


(5.1) 


(see  equation  (2.19)).  This  would  complicate  the  gradient  calculation  quite 
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a  bit.  Also,  this  bound  (5.1)  is  conservative,  which  makes  it  less 
meaningful.  However,  the  example  showed  that  it  was  possible  to  increase 
a(a,i,o)  for  w<  .1  while  keeping  5(G)  relatively  constant  just  using  the 
gradient  of  a  (i,o)  since  this  gradient  decoupled  the  motion  of  the  singular 
values.  Thus,  a  more  complicated  gradient  calculation  of  equation  (5.1) 
could  be  avoided.  Note  that  this  problem  does  not  occur  in  the  case  of 
multiplicative  perturbations  -  the  frequency  plot  of  o(m,*,o)  provides  a 
direct  nonconservative  measure  of  the  desired  properties. 
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